Strategic  Delay  and  Information  Exchange  in 
Endogenous  Social  Networks 

by 

Kostas  Bimpikis 

B.Sc.,  National  Technical  University  of  Athens,  Greece  (2003) 
M.Sc.,  University  of  California,  San  Diego  (2005) 


■  massachu  v:riv'  institute 

Or-  T":C'  ''  .  ■  '  V 


Submitted  to  the  Sloan  School  of  Management 
in  partial  fulfillment  of  the  requirements  for  the  degree  of 

Doctor  of  Philosophy  in  Operations  Research 

at  the 

MASSACHUSETTS  INSTITUTE  OF  TECHNOLOGY 

September  2010 

©  Massachusetts  Institute  of  Technology  2010.  All  rights  reserved. 


ARCHIVES 


Author . ‘ 

,  ~  A 

.  .  Ac* 


Certified  by 


Certified  by . 


\ 


Sloan  School  of  Management 
August  11,  2010 


Daron  Acemoglu 
Professor 
Thesis  Supervisor 


Asuman  Ozdaglar 
Associate  Professor 
Thesis  Supervisor 


Accepted  by 


v  •  * 


Dimitris  Bertsimas 
Co-director,  Operations  Research  Center 


Report  Documentation  Page 

Form  Approved 

OMB  No.  0704-0188 

Public  reporting  burden  for  the  collection  of  information  is  estimated  to  average  1  hour  per  response,  including  the  time  for  reviewing  instructions,  searching  existing  data  sources,  gathering  and 
maintaining  the  data  needed,  and  completing  and  reviewing  the  collection  of  information.  Send  comments  regarding  this  burden  estimate  or  any  other  aspect  of  this  collection  of  information, 
including  suggestions  for  reducing  this  burden,  to  Washington  Headquarters  Services,  Directorate  for  Information  Operations  and  Reports,  1215  Jefferson  Davis  Highway,  Suite  1204,  Arlington 

VA  22202-4302.  Respondents  should  be  aware  that  notwithstanding  any  other  provision  of  law,  no  person  shall  be  subject  to  a  penalty  for  failing  to  comply  with  a  collection  of  information  if  it 
does  not  display  a  currently  valid  OMB  control  number. 

1.  REPORT  DATE 

SEP  2010  2.  REPORT  TYPE 

3.  DATES  COVERED 

00-00-2010  to  00-00-2010 

4.  TITLE  AND  SUBTITLE 

Strategic  Delay  and  Information  Exchange  in  Endogenous  Social 

Networks 

5a.  CONTRACT  NUMBER 

5b.  GRANT  NUMBER 

5c.  PROGRAM  ELEMENT  NUMBER 

6.  AUTHOR(S) 

5d.  PROJECT  NUMBER 

5e.  TASK  NUMBER 

5f.  WORK  UNIT  NUMBER 

7.  PERFORMING  ORGANIZATION  NAME(S)  AND  ADDRESS(ES) 

Massachusetts  Institute  of  Technology, Cambridge, MA, 02139 

8.  PERFORMING  ORGANIZATION 

REPORT  NUMBER 

9.  SPONSORING/MONITORING  AGENCY  NAME(S)  AND  ADDRESS(ES) 

10.  SPONSOR/MONITOR'S  ACRONYM(S) 

11.  SPONSOR/MONITOR'S  REPORT 
NUMBER(S) 

12.  DISTRIBUTION/AVAILABILITY  STATEMENT 

Approved  for  public  release;  distribution  unlimited 

13.  SUPPLEMENTARY  NOTES 


14.  ABSTRACT 

This  thesis  studies  optimal  stopping  problems  for  strategic  agents  in  the  context  of  two  economic 
applications:  experimentation  in  a  competitive  market  and  information  exchange  in  social  networks.  The 
economic  agents  (firms  in  the  first  application,  individuals  in  the  second)  take  actions,  whose  payoffs 
depend  on  an  unknown  underlying  state.  Our  framework  is  characterized  by  the  following  key  feature: 
agents  time  their  actions  to  take  advantage  of  either  the  outcome  of  the  actions  of  others  (experimentation 
model)  or  information  obtained  over  time  by  their  peers  (information  exchange  model).  Equilibria  in  both 
environments  are  typically  inefficient,  since  information  is  imperfect  and,  thus,  there  is  a  benefit  in  being  a 
late  mover,  but  delaying  is  costly.  More  specifically,  in  the  first  part  of  the  thesis,  we  develop  a  model  of 
experimentation  and  innovation  in  a  competitive  multi-firm  environment.  Each  firm  receives  a  private 
signal  on  the  success  probability  of  a  research  project  and  decides  when  and  which  project  to  implement.  A 
successful  innovation  can  be  copied  by  other  firms.  We  start  the  analysis  by  considering  the  symmetric 
environment,  where  the  signal  quality  is  the  same  for  all  firms.  Symmetric  equilibria  (where  actions  do  not 
depend  on  the  identity  of  the  firm)  always  involve  delayed  and  staggered  experimentation,  whereas  the 
optimal  allocation  never  involves  delays  and  may  involve  simultaneous  rather  than  staggered 
experimentation.  The  social  cost  of  insufficient  experimentation  can  be  arbitrarily  large.  Then  we  study  the 
role  of  simple  instruments  in  improving  over  equilibrium  outcomes.  We  show  that  appropriately-designed 
patents  can  implement  the  socially  optimal  allocation  (in  all  equilibria)  by  encouraging  rapid 
experimentation  and  efficient  ex  post  transfer  of  knowledge  across  firms.  In  contrast  to  patents,  subsidies 
to  experimentation,  research,  or  innovation  cannot  typically  achieve  this  objective.  We  also  discuss  the  case 
when  signal  quality  is  private  information  and  differs  across  firms.  We  show  that  in  this  more  general 
environment  patents  again  encourage  experimentation  and  reduce  delays.  In  the  second  part,  we  study  a 
model  of  information  exchange  among  rational  individuals  through  communication  and  investigate  its 
implications  for  information  aggregation  in  large  societies.  An  underlying  state  (of  the  world)  determines 
which  action  has  higher  payoff.  Agents  receive  a  private  signal  correlated  with  the  underlying  state.  They 
then  exchange  information  over  their  social  network  until  taking  an  (irreversible)  action.  We  define 
asymptotic  learning  as  the  fraction  of  agents  taking  an  action  that  is  close  to  optimal  converging  to  one  in 
probability  as  a  society  grows  large.  Under  truthful  communication 


15.  SUBJECT  TERMS 

16.  SECURITY  CLASSIFICATION  OF: 

17.  LIMITATION  OF 
ABSTRACT 

Same  as 
Report  (SAR) 

18.  NUMBER 
OF  PAGES 

165 

19a.  NAME  OF 
RESPONSIBLE  PERSON 

a.  REPORT 

unclassified 

b.  ABSTRACT 

unclassified 

c.  THIS  PAGE 

unclassified 

Standard  Form  298  (Rev.  8-98) 

Prescribed  by  ANSI  Std  Z39-18 


2 


Strategic  Delay  and  Information  Exchange  in  Endogenous  Social 

Networks 

by 

Kostas  Bimpikis 


Submitted  to  the  Sloan  School  of  Management 
on  August  11,  2010,  in  partial  fulfillment  of  the 
requirements  for  the  degree  of 
Doctor  of  Philosophy  in  Operations  Research 


Abstract 

This  thesis  studies  optimal  stopping  problems  for  strategic  agents  in  the  context  of 
two  economic  applications:  experimentation  in  a  competitive  market  and  information  ex¬ 
change  in  social  networks.  The  economic  agents  (firms  in  the  first  application,  individuals 
in  the  second)  take  actions,  whose  payoffs  depend  on  an  unknown  underlying  state.  Our 
framework  is  characterized  by  the  following  key  feature:  agents  time  their  actions  to  take 
advantage  of  either  the  outcome  of  the  actions  of  others  (experimentation  model)  or  in¬ 
formation  obtained  over  time  by  their  peers  (information  exchange  model).  Equilibria  in 
both  environments  are  typically  inefficient,  since  information  is  imperfect  and,  thus,  there 
is  a  benefit  in  being  a  late  mover,  but  delaying  is  costly. 

More  specifically,  in  the  first  part  of  the  thesis,  we  develop  a  model  of  experimenta¬ 
tion  and  innovation  in  a  competitive  multi-firm  environment.  Each  firm  receives  a  pri¬ 
vate  signal  on  the  success  probability  of  a  research  project  and  decides  when  and  which 
project  to  implement.  A  successful  innovation  can  be  copied  by  other  firms.  We  start 
the  analysis  by  considering  the  symmetric  environment,  where  the  signal  quality  is  the 
same  for  all  firms.  Symmetric  equilibria  (where  actions  do  not  depend  on  the  identity 
of  the  firm)  always  involve  delayed  and  staggered  experimentation,  whereas  the  optimal 
allocation  never  involves  delays  and  may  involve  simultaneous  rather  than  staggered  exper¬ 
imentation.  The  social  cost  of  insufficient  experimentation  can  be  arbitrarily  large.  Then, 
we  study  the  role  of  simple  instruments  in  improving  over  equilibrium  outcomes.  We  show 
that  appropriately-designed  patents  can  implement  the  socially  optimal  allocation  (in  all 
equilibria)  by  encouraging  rapid  experimentation  and  efficient  ex  post  transfer  of  knowledge 
across  firms.  In  contrast  to  patents,  subsidies  to  experimentation,  research,  or  innovation 
cannot  typically  achieve  this  objective.  We  also  discuss  the  case  when  signal  quality  is  pri¬ 
vate  information  and  differs  across  firms.  We  show  that  in  this  more  general  environment 
patents  again  encourage  experimentation  and  reduce  delays. 

In  the  second  part,  we  study  a  model  of  information  exchange  among  rational  individ¬ 
uals  through  communication  and  investigate  its  implications  for  information  aggregation 
in  large  societies.  An  underlying  state  (of  the  world)  determines  which  action  has  higher 
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payoff.  Agents  receive  a  private  signal  correlated  with  the  underlying  state.  They  then 
exchange  information  over  their  social  network  until  taking  an  (irreversible)  action.  We 
define  asymptotic  learning  as  the  fraction  of  agents  taking  an  action  that  is  close  to  optimal 
converging  to  one  in  probability  as  a  society  grows  large.  Under  truthful  communication, 
we  show  that  asymptotic  learning  occurs  if  (and  under  some  additional  conditions,  also  only 
if)  in  the  social  network  most  agents  are  a  short  distance  away  from  “information  hubs”, 
which  receive  and  distribute  a  large  amount  of  information.  Asymptotic  learning  therefore 
requires  information  to  be  aggregated  in  the  hands  of  a  few  agents.  We  also  show  that  while 
truthful  communication  is  not  always  optimal,  when  the  communication  network  induces 
asymptotic  learning  (in  a  large  society),  truthful  communication  is  an  equilibrium.  Then, 
we  discuss  the  welfare  implications  of  equilibrium  behavior.  In  particular,  we  compare  the 
aggregate  welfare  at  equilibrium  with  that  of  the  optimal  allocation,  which  is  defined  as  the 
strategy  profile  a  social  planner  would  choose,  so  as  to  maximize  the  expected  aggregate 
welfare.  We  show  that  when  asymptotic  learning  occurs  all  equilibria  are  efficient.  A  partial 
converse  is  also  true:  if  asymptotic  learning  does  not  occur  at  the  optimal  allocation  and  an 
additional  mild  condition  holds  at  an  equilibrium,  then  the  equilibrium  is  inefficient.  Fur¬ 
thermore,  we  discuss  how  our  learning  results  can  be  applied  to  several  commonly  studied 
random  graph  models,  such  as  preferential  attachment  and  Erdos-Renyi  graphs. 

In  the  final  part,  we  study  strategic  network  formation  in  the  context  of  information 
exchange.  In  particular,  we  relax  the  assumption  that  the  social  network  over  which  agents 
communicate  is  fixed,  and  we  let  agents  decide  which  agents  to  form  a  communication  link 
with  incurring  an  associated  cost.  We  provide  a  systematic  investigation  of  what  types 
of  cost  structures  and  associated  social  cliques  (consisting  of  groups  of  individuals  linked 
to  each  other  at  zero  cost,  such  as  friendship  networks)  ensure  the  emergence  of  commu¬ 
nication  networks  that  lead  to  asymptotic  learning.  Our  result  shows  that  societies  with 
too  many  and  sufficiently  large  social  cliques  do  not  induce  asymptotic  learning,  because 
each  social  clique  would  have  sufficient  information  by  itself,  making  communication  with 
others  relatively  unattractive.  Asymptotic  learning  results  if  social  cliques  are  neither  too 
numerous  nor  too  large,  in  which  case  communication  across  cliques  is  encouraged. 
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Chapter  1 


Introduction 


Optimal  stopping  problems  have  been  considered  extensively  in  the  literature.  They  typi¬ 
cally  involve  a  single  agent  or  a  group  of  agents  that  are  faced  with  the  problem  of  deciding 
when  to  take  an  action,  so  as  to  maximize  their  utility.  The  decision  problem  is  non-trivial 
due  to  the  following  tradeoff:  delaying  leads  to  a  “better”  action,  however,  future  is  dis¬ 
counted  and,  thus,  agents  cannot  wait  forever.  Moreover,  the  optimal  stopping  time  for  an 
agent  depends  in  many  cases  crucially  on  the  action  profiles  of  her  peers,  thus  necessitat¬ 
ing  incorporating  strategic  considerations  into  the  framework  and  treating  it  as  a  dynamic 
game. 

In  this  thesis,  we  formulate  optimal  stopping  games  or  waiting  games  (see  [22])  to  study 
two  seemingly  unrelated  economic  applications:  (1)  investment  in  R&D  in  the  presence  of 
competitors,  that  may  choose  to  free-ride  on  one’s  experimentation  efforts  and  (2)  informa¬ 
tion  exchange  among  rational  individuals,  that  are  embedded  in  a  social  network  structure. 
We  provide  a  characterization  of  the  equilibria  in  both  settings  and  discuss  conditions  un¬ 
der  which  they  are  inefficient,  i.e. ,  there  exist  allocations,  that  achieve  a  higher  expected 
aggregate  welfare. 

Alongside  with  the  characterization  of  equilibria  and  discussion  of  welfare  loss,  we  pro¬ 
vide  a  number  of  results  that  are  specific  to  each  of  the  two  applications.  In  particular, 
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in  the  first  model  (investment  in  R&D)  we  investigate  the  role  of  simple  policies  for  im¬ 
proving  the  aggregate  welfare.  We  show  that  although  subsidies  from  third  parties  fail  to 
restore  the  distorted  incentives,  simple  patents  may  succeed  (under  certain  conditions)  in 
both  dealing  with  costly  delay  and  allowing  firms  to  share  their  knowledge.  In  the  second 
environment,  we  are  interested  in  whether  agents  eventually  learn ,  i.e.,  choose  an  action, 
that  is  close  to  the  optimal,  and  when  both  objectives  (learning  and  maximizing  welfare) 
coincide. 

Roughly  the  thesis  is  decomposed  into  three  parts:  the  first  (Chapters  2-4)  discusses  our 
model  of  experimentation  and  innovation,  the  second  (Chapters  5-7)  studies  information 
exchange  in  already  formed  social  networks  and,  finally,  the  last  part  (Chapter  8)  deals 
with  the  complementary  problem  of  network  formation:  given  that  agents  are  interested  in 
acquiring  as  much  information  as  early  as  possible,  what  kinds  of  communication  networks 
arise  and  what  are  their  learning  properties?  The  following  two  sections  describe  our  models 
in  a  high  level  and  discuss  relevant  literature.  Finally,  in  Section  1.3  we  provide  a  more 
detailed  roadmap  of  the  thesis. 

1.1  Strategic  delay  in  a  model  of  experimentation 

Most  modern  societies  provide  intellectual  property  rights  protection  to  innovators  using 
a  patent  system.  The  main  argument  in  favor  of  patents  is  that  they  encourage  ex  ante 
innovation  by  creating  ex  post  monopoly  rents  (e.g.,  [4],  [7],  [68],  [34],  [35],  [39],  [53],  [54], 
[58],  [59],  [60]).  In  the  first  part  of  the  thesis,  we  suggest  an  alternative  (and  complementary) 
social  benefit  to  patents.  We  show  that,  under  certain  circumstances,  patents  encourage 
experimentation  by  potential  innovators  while  still  allowing  socially  beneficial  transmission 
of  knowledge  across  firms. 

We  construct  a  stylized  model  of  experimentation  and  innovation.  In  our  baseline  game, 
N  symmetric  potential  innovators  (firms)  have  each  access  to  a  distinct  research  opportu¬ 
nity  and  a  private  signal  on  how  likely  it  is  to  result  in  a  successful  innovation.  Firms  can 


14 


decide  to  experiment  at  any  point  in  time.  A  successful  innovation  is  publicly  observed 
and  can  be  copied  by  any  of  the  other  potential  innovators  (for  example,  other  firms  can 
build  on  the  knowledge  revealed  by  the  innovation  in  order  to  increase  their  own  proba¬ 
bility  of  success,  but  in  the  process  capture  some  of  the  rents  of  this  first  innovator).  The 
returns  from  the  implementation  of  a  successful  innovation  are  nonincreasing  in  the  number 
of  firms  implementing  it.  We  provide  an  explicit  characterization  of  the  equilibria  of  this 
dynamic  game.  The  symmetric  equilibrium  always  features  delayed  and  staggered  exper¬ 
imentation.  In  particular,  experimentation  does  not  take  place  immediately  and  involves 
one  firm  experimenting  before  others  (and  the  latter  firms  free-riding  on  the  former’s  ex¬ 
perimentation).  In  contrast,  the  optimal  allocation  never  involves  delays  and  may  require 
simultaneous  rather  than  staggered  experimentation.  The  insufficient  equilibrium  incen¬ 
tives  for  experimentation  may  create  a  significant  efficiency  loss:  the  ratio  of  social  surplus 
generated  by  the  equilibrium  relative  to  the  optimal  allocation  can  be  arbitrarily  small. 

We  next  show  that  a  simple  arrangement  resembling  a  patent  system,  where  a  copying 
firm  has  to  make  a  prespecifed  payment  to  the  innovator,  can  implement  the  optimal  allo¬ 
cation  (we  refer  to  this  arrangement  as  a  “patent  system”).  When  the  optimal  allocation 
involves  simultaneous  experimentation,  the  patent  system  makes  free-riding  prohibitively 
costly  and  implements  the  optimal  allocation  as  the  unique  equilibrium.  When  the  optimal 
allocation  involves  staggered  experimentation,  the  patent  system  plays  a  more  subtle  role. 
It  permits  ex  post  transmission  of  knowledge  but  still  increases  experimentation  incentives 
to  avoid  delays.  The  patent  system  can  achieve  this  because  it  generates  “conditional” 
transfers.  An  innovator  receives  a  patent  payment  only  when  copied  by  other  firms.  Con¬ 
sequently,  patents  encourage  one  firm  to  experiment  earlier  than  others,  thus  achieving 
rapid  experimentation  without  sacrificing  useful  transfer  of  knowledge.  Moreover,  we  show 
that  patents  can  achieve  this  outcome  in  all  equilibria.  The  fact  that  patents  are  par¬ 
ticularly  well  designed  to  play  this  role  is  also  highlighted  by  our  result  that  while  an 
appropriately-designed  patent  implements  the  optimal  allocation  in  all  equilibria,  subsidies 
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to  experimentation,  research,  or  innovation  cannot  achieve  the  same  objective. 

In  our  baseline  model,  both  the  optimal  allocation  and  the  symmetric  equilibrium  in¬ 
volve  sequential  experimentation.  Inefficiency  results  from  lack  of  sufficient  experimenta¬ 
tion  or  from  delays.  The  structure  of  equilibria  is  richer  when  the  strength  (quality)  of  the 
signals  received  by  potential  innovators  differs  and  is  also  private  information.  In  this  case, 
those  with  sufficiently  strong  signals  will  prefer  not  to  copy  successful  innovations.  We 
show  that  in  this  more  general  environment  patents  are  again  potentially  useful  (though 
they  cannot  typically  achieve  the  optimal  allocation). 

Although  our  analysis  is  purely  theoretical,  we  believe  that  the  insights  it  generates,  in 
particular  regarding  the  role  of  patents  in  discouraging  delay  in  research  and  encouraging 
experimentation,  are  relevant  for  thinking  about  the  role  of  the  patent  systems  in  practice. 
Two  assumptions  are  important  for  our  results.  The  first  is  that  pursuing  an  unsuccessful 
research  line  makes  a  subsequent  switch  to  an  alternative  line  more  costly.  We  impose  this 
feature  in  the  simplest  possible  way,  assuming  that  such  a  switch  is  not  possible,  though 
our  qualitative  results  would  not  be  affected  if  switching  were  feasible  but  costly.  We  view 
this  assumption  as  a  reasonable  approximation  to  reality.  Commitment  of  intellectual  and 
financial  resources  to  a  specific  research  line  or  vision  is  necessary  for  success,  and  once 
such  commitment  has  been  made,  changing  course  is  not  easy.1  Our  second  key  assumption 
is  that  copying  of  successful  projects  is  possible  (without  prohibitive  patents)  and  reduces 
the  returns  to  original  innovators.  This  assumption  also  appears  quite  plausible.  Copying 
of  a  successful  project  here  should  be  interpreted  more  broadly  as  using  the  information 


1For  example,  in  the  computer  industry,  firms  such  as  Digital  Equipment  Corporation  (DEC)  that 
specialized  in  mainframes  found  it  difficult  to  make  a  successful  switch  to  personal  computers  (e.g.,  [26]). 
Similarly,  early  innovators  in  the  cell  phone  industry,  such  as  Nokia  and  Ericsson,  appear  to  be  slow 
in  switching  to  the  new  generation  of  more  advanced  wireless  devices  and  smartphones,  and  have  been 
generally  lagging  behind  companies  such  as  Apple  and  RIM.  Another  interesting  example  comes  from  the 
satellite  launches.  The  early  technology  choice  for  launching  satellites  into  space  relied  on  large  ground- 
based  rockets;  despite  evidence  that  using  smaller  rockets  and  carrying  these  to  the  upper  atmosphere 
using  existing  aerospace  equipment  would  be  considerably  cheaper  and  more  flexible,  organizations  such 
as  NASA  have  not  switched  to  this  new  technology,  while  private  space  technology  companies  have  (see 
[43]). 
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revealed  by  successful  innovation  or  experimentation,  so  it  does  not  need  to  correspond  to 
replicating  the  exact  same  innovation  (or  product),2  and  naturally  such  copying  will  have 
some  negative  effect  on  the  returns  of  the  original  innovator. 

In  addition  to  the  literature  on  patents  mentioned  above,  a  number  of  other  works  are 
related  to  the  model  presented  here.  First,  ours  is  a  simple  model  of  (social)  experimen¬ 
tation  and  shares  a  number  of  common  features  with  recent  work  in  this  area  (e.g.,  [17] 
and  [52]).  These  papers  characterize  equilibria  of  multi-agent  two- armed  bandit  problems 
and  show  that  there  may  be  insufficient  experimentation.  The  structure  of  the  equilibrium 
is  particularly  simple  in  our  model  and  can  be  characterized  explicitly  because  all  payoff¬ 
relevant  uncertainty  is  revealed  after  a  single  successful  experimentation.  In  addition,  as 
discussed  above,  there  is  insufficient  experimentation  in  our  model  as  well,  though  this 
also  takes  a  simple  form:  either  there  is  free-riding  by  some  firms  reducing  the  amount 
of  experimentation  or  experimentation  is  delayed.  We  also  show  that  patent  systems  can 
increase  experimentation  incentives  and  implement  the  optimal  allocation. 

Second,  the  structure  of  equilibria  with  symmetric  firms  is  reminiscent  to  equilibria 
in  war  of  attrition  games  (e.g.,  [42],  [46],  [64]).  War  of  attrition  games  have  been  used 
in  various  application  domains,  such  as  the  study  of  market  exit  ([18]  and  [29]),  research 
tournaments  ([67]),  auctions  ([19]),  investment  choices  ([20]),  exploratory  drilling  ([44]  and 
[45])  and  the  diffusion  of  new  technologies  ([51]).  Similar  in  spirit  with  our  work,  [22] 
discusses  waiting  games  of  technological  change,  in  which  there  is  a  late-mover  advantage 
due  to  knowledge  spillovers.  In  our  symmetric  model,  as  in  symmetric  wars  of  attrition, 
players  choose  the  stochastic  timing  of  their  actions  in  such  a  way  as  to  make  other  players 
indifferent  and  willing  to  mix  over  the  timing  of  their  own  actions.  The  structure  of 
equilibria  and  the  optimal  allocation  is  different,  however,  and  the  optimal  allocation  may 
involve  either  simultaneous  experimentation  by  all  players  or  staggered  experimentation 

2In  terms  of  the  examples  in  footnote  1,  while  DEC,  Nokia  and  Ericsson  may  have  been  slow  in  adopting 
new  technologies,  several  other,  new  companies  have  built  on  the  technological  advances  that  took  place 
in  personal  computers  and  smartphones. 
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similar  to  that  resulting  in  asymmetric  equilibria.  The  novel  beneficial  role  of  patents  in 
our  model  arises  from  their  ability  to  implement  such  asymmetric  equilibria. 

Finally,  the  monotonicity  property  when  the  quality  of  signals  differs  across  agents  is 
similar  to  results  in  generalized  wars  of  attrition  (e.g.,  [18],  [19]  and  [29])  and  is  also  related 
to  [40]  result  on  the  clustering  of  actions  in  herding  models.  In  the  context  of  a  standard 
herding  model  with  endogenous  timing,  Gul  and  Lundholm  construct  an  equilibrium  in 
which  agents  with  stronger  signals  act  earlier  than  those  with  weaker  signals,  though  the 
specifics  of  our  model  and  analysis  differs  from  these  previous  contributions.  3 

1.2  Information  exchange  in  endogenous  social  net¬ 
works 

Most  social  decisions,  ranging  from  product  and  occupational  choices  to  voting  and 
political  behavior,  rely  on  information  agents  gather  through  communication  with  friends, 
neighbors  and  co-workers  as  well  as  information  obtained  from  news  sources  and  prominent 
webpages.  A  central  question  in  social  sciences  concerns  the  dynamics  of  communication 
and  information  exchange  and  whether  such  dynamics  lead  to  the  effective  aggregation  of 
dispersed  information  that  exists  in  a  society.  In  the  second  part  of  the  thesis,  we  construct 
a  dynamic  model  to  investigate  this  question.  If  information  exchanges  were  non-strategic, 
timeless  and  costless,  all  information  could  be  aggregated  immediately  by  simultaneous 
communication  across  all  agents.  Thus,  the  key  ingredient  of  our  approach  is  dynamic  and 
costly  communication. 

Our  benchmark  model  features  an  underlying  state  of  the  world  that  determines  which 
action  has  higher  payoff  (which  is  the  same  for  all  agents).  Because  of  discounting,  ear¬ 
lier  actions  are  preferred  to  later  ones.  Each  agent  receives  a  private  signal  correlated 
with  this  underlying  state.  In  addition,  agents  communicate  with  those  others  with  whom 

3This  monotonicity  property  does  not  hold  in  our  model  when  there  are  more  than  two  firms  and 
the  support  of  signals  includes  sufficiently  strong  signals  so  that  some  firms  prefer  not  to  copy  successful 
experimentations  as  is  shown  in  Appendix  B. 
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they  are  connected  in  their  social  network  until  they  take  an  irreversible  action.  Crucially, 
information  acquisition  takes  time  because  the  “neighbors’  of  an  agent  with  whom  she 
communicates  acquire  more  information  from  their  own  neighbors  over  time.  Information 
exchange  will  thus  be  endogenously  limited  by  two  features:  the  communication  network 
,  which  allows  communication  only  between  connected  pairs,  and  discounting,  which  en¬ 
courages  agents  to  take  actions  before  they  accumulate  sufficient  information. 

We  characterize  the  equilibria  of  this  communication  game  and  then  investigate  the 
structure  of  these  equilibria  as  the  society  becomes  large  (i.e.,  for  a  sequence  of  games). 
Our  main  focus  is  on  how  well  information  is  aggregated,  which  we  capture  with  the  notion 
of  e,  S- asymptotic  learning.  We  say  that  there  is  e,  5- asymptotic  learning  if  more  than 
(1  —  e)- fraction  of  the  agents  take  an  action  that  is  e-close  to  optimal  with  probability  at 
least  1-5,  as  the  society  becomes  large.  Furthermore,  we  say  that  perfect  asymptotic 
learning  occurs,  when  the  fraction  of  agents  that  take  a  nearly  optimal  action  converges  to 
one  in  probability  as  the  society  grows. 

Our  analysis  proceeds  in  several  stages.  First,  we  assume  that  agents  are  non-strategic 
in  their  communication.  Under  these  assumptions,  we  provide  a  necessary  and  a  sufficient 
condition  for  e,  d- asymptotic  learning  under  a  given  equilibrium  strategy  profile.  Intuitively, 
these  conditions  require  that  most  of  the  agents  have  close  access  to  a  sufficient  amount  of 
information.  We  also  show  a  sufficient  and  (under  a  mild  condition)  necessary  condition 
for  perfect  asymptotic  learning,  that  holds  for  all  equilibria  and  it  only  involves  the  net¬ 
work  topology.  The  learning  results  are  further  illustrated  by  a  series  of  corollaries  that 
identify  two  types  of  information  hubs  as  conduits  of  asymptotic  learning.  The  first  are 
information  mavens,  which  have  a  large  in-degree,  enabling  them  to  aggregate  information. 
If  most  agents  are  close  to  an  information  maven,  asymptotic  learning  is  guaranteed.  The 
second  type  of  hubs  are  social  connectors,  which  have  large  out-degree,  enabling  them  to 
communicate  their  information  to  a  large  number  of  agents.4  Social  connectors  are  only 

4both  of  these  terms  are  inspired  by  [36]. 
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useful  for  asymptotic  learning  if  they  are  close  to  mavens,  so  that  they  can  distribute  their 
information.  Thus,  asymptotic  learning  is  also  obtained  if  most  agents  are  close  to  a  social 
connector,  who  is  in  turn  a  short  distance  away  from  a  maven. 

Second,  we  generalize  these  results  to  an  environment,  in  which  individuals  may  mis- 
report  their  information  if  they  have  an  incentive  to  do  so.  In  particular,  we  show  that 
individuals  may  in  general  choose  to  misreport  their  information  in  order  to  delay  the  action 
of  their  neighbors,  thus  obtaining  more  information  from  them  in  the  future.  Nevertheless, 
we  establish  that  whenever  truthful  communication  leads  to  perfect  asymptotic  learning, 
it  is  an  equilibrium  of  the  strategic  communication  game  to  report  truthfully.  Interest¬ 
ingly,  the  converse  is  not  necessarily  true:  strategic  communication  may  lead  to  asymptotic 
learning  in  some  special  cases  in  which  truthful  communication  precludes  learning. 

Our  characterization  results  on  asymptotic  learning  can  be  seen  both  as  “positive” 
and  “negative” .  On  the  one  hand,  communication  structures  that  do  not  feature  such  hubs 
appear  more  realistic  in  the  context  of  social  networks  and  communication  between  friends, 
neighbors  and  co-workers.  Indeed,  the  popular  (though  not  always  empirically  plausible) 
random  graph  models  such  as  preferential  attachment  and  Poisson  (Erdos- Renyi)  graphs  do 
not  lead  to  asymptotic  learning.  On  the  other  hand,  as  discussed  above,  most  individuals 
obtain  key  information  from  either  individuals  or  news  sources  (websites)  that  correspond  to 
mavens  and  social  connectors,  which  do  play  the  role  of  aggregating  and  distributing  large 
amounts  of  information.  Corresponding  to  such  structures,  we  show  that  scale  free  random 
graphs  (in  particular,  power  law  graphs  with  small  exponent  7  <  2), 5  and  hierarchical 
graphs,  where  “special”  agents  are  likely  to  receive  and  distribute  information  to  lower 
layers  of  the  hierarchy,  induce  network  structures  that  guarantee  asymptotic  learning.  The 
intuition  for  why  such  information  hubs  and  almost  all  agents  being  close  to  information 
hubs  are  necessary  for  asymptotic  learning  is  instructive:  were  it  not  so,  a  large  fraction 

5  These  models  are  shown  to  provide  good  representations  for  peer-to-peer  networks,  scientific  collabo¬ 
ration  networks  (in  experimental  physics),  and  traffic  in  networks  ([50],  [57],  [69]  and  [70].) 
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of  agents  would  prefer  to  take  an  action  before  waiting  for  sufficient  information  to  arrive 
and  a  nontrivial  fraction  of  those  would  take  a  suboptimal  action. 

Furthermore,  we  discuss  the  welfare  implications  of  equilibrium  behavior.  In  particu¬ 
lar,  we  compare  the  expected  aggregate  welfare  at  equilibrium  with  that  at  the  optimal 
allocation ,  which  is  defined  as  the  timing  profile  that  a  social  planner  would  choose,  so  as 
to  maximize  it.  We  show  that  when  perfect  asymptotic  learning  occurs,  all  equilibria  are 
efficient.  Moreover,  a  partial  converse  holds:  if  e,  d-asymptotic  learning  fails  at  equilibrium 
and  an  additional  condition  holds,  then  the  equilibrium  is  inefficient.  Our  results  regarding 
welfare  uncover  an  interesting  feature  of  communication  in  social  networks:  information 
exchange  is  inefficient,  because  agents  do  not  internalize  the  positive  externality  that  their 
presence  exerts  on  their  peers. 

In  the  third  part  of  the  thesis,  we  drop  the  assumption  that  the  social  network  is  given 
and  we  let  agents  form  communication  links ,  which  may  be  costly,  thus  combining  social 
learning  with  strategic  network  formation.  The  network  formation  decisions  of  agents 
induce  a  communication  graph  for  the  society,  over  which  communication  takes  place  in 
the  way  described  above.  Armed  with  the  analysis  of  information  exchange  over  a  given 
communication  network,  we  analyze  the  equilibria  of  the  two-stage  network  formation  and 
information  exchange  game,  under  the  assumption  that  although  forming  communication 
links  is  costly,  there  also  exist  social  cliques ,  groups  of  individuals  that  are  linked  to  each 
other  at  zero  cost.  These  can  be  thought  of  as  “friendship  networks,”  which  are  linked  for 
reasons  unrelated  to  information  exchange  and  thus  act  as  conduits  of  such  exchange  at 
low  cost.  Agents  have  to  pay  a  cost  at  the  beginning  in  order  to  communicate  (receive 
information)  from  those  who  are  not  in  their  social  clique. 

Even  though  network  formation  games  have  several  equilibria,  the  structure  of  our 
network  formation  and  information  exchange  game  enables  us  to  obtain  relatively  sharp 
results  on  what  types  of  societies  will  lead  to  endogenous  communication  networks  that 
ensure  asymptotic  learning.  In  particular,  we  show  that  societies  with  too  many  (disjoint) 
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and  sufficiently  large  social  cliques  induce  behavior  inconsistent  with  asymptotic  learning. 
This  is  because  each  social  clique,  which  is  sufficiently  large,  would  have  enough  information 
to  make  communication  with  others  (from  other  social  cliques)  unattractive;  the  society 
gets  segregated  into  a  very  large  number  of  disjoint  social  cliques  not  sharing  information. 
In  contrast,  asymptotic  learning  obtains  in  equilibrium  if  social  cliques  are  neither  too 
numerous  nor  too  large  so  that  it  becomes  advantageous  at  least  for  some  members  of 
these  cliques  to  communicate  with  members  of  other  cliques,  forming  a  structure  in  which 
information  is  shared  across  (almost)  all  members  of  the  society. 

Our  work  is  related  to  several  strands  of  literature  on  social  and  economic  networks. 
First,  it  is  related  to  the  large  and  growing  literature  on  social  learning.  Much  of  this 
literature  focuses  on  Bayesian  models  of  observational  learning,  where  each  individual  learns 
from  the  actions  of  others  taken  in  the  past.  A  key  impediment  to  information  aggregation 
in  these  models  is  the  fact  that  actions  do  not  reflect  all  of  the  information  that  an  individual 
has  and  this  can  induce  a  pattern  reminiscent  to  a  “herd,”  where  individuals  ignore  their 
own  information  and  copy  the  behavior  of  others  (see,  for  example,  [12],  [15]  and  [65],  as 
well  as  [10],  for  early  contributions,  and  [3],  [13]  and  [66]  for  models  of  Bayesian  learning 
with  richer  observational  structures).  While  observational  learning  is  important  in  many 
situations,  a  large  part  of  information  exchange  in  practice  is  through  communication. 

Several  papers  in  the  literature  study  communication,  though  typically  using  non- 
Bayesian  or  “myopic”  rules  (for  example,  [23]  and  [37]).  A  major  difficulty  faced  by  these 
approaches,  often  precluding  Bayesian  and  dynamic  game  theoretic  analysis  of  learning  in 
communication  networks,  is  the  complexity  of  updating  when  individuals  share  their  ex-post 
beliefs.  We  overcome  this  difficulty  by  adopting  a  different  approach,  whereby  individuals 
can  directly  communicate  their  signals  and  there  is  no  restriction  on  the  total  “bits”  of 
communication.  This  leads  to  a  very  tractable  structure  for  updating  of  beliefs  and  enables 
us  to  study  perfect  Bayesian  equilibria  of  a  game  of  network  formation,  communication 
and  decision-making.  It  also  reverses  one  of  the  main  insights  of  these  papers,  also  shared 
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by  the  pioneering  contribution  to  the  social  learning  literature  by  [10],  that  the  presence 
of  “highly  connected”  or  “influential”  agents,  or  what  [10]  call  a  “royal  family,”  acts  as  a 
significant  impediment  to  the  efficient  aggregation  of  information.  On  the  contrary,  in  our 
model  the  existence  of  such  highly  connected  agents  (information  hubs,  mavens  or  connec¬ 
tors)  is  crucial  for  the  efficient  aggregation  of  information.  Moreover,  their  existence  also 
reduces  incentives  for  noil-truthful  communication,  and  is  the  key  input  into  our  result  that 
truthful  communication  can  be  an  e-equilibrium. 

Our  analysis  of  asymptotic  learning  in  large  networks  also  builds  on  random  graph 
models.  In  particular,  we  use  several  tools  and  results  from  this  literature  to  characterize 
the  asymptotics  of  beliefs  and  information.  We  also  study  information  aggregation  in  the 
popular  preferential  attachment  and  Erdos-Renyi  graphs  (e.g.,  [5],  [14],  [25]  and  [55]). 

Our  work  is  also  related  to  the  growing  literature  on  network  formation,  since  com¬ 
munication  takes  place  over  endogenously  formed  networks  (Chapter  8).  Although  the 
network  formation  literature  is  large  and  growing  (see,  e.g.,  [11],  [47]  and  [49]),  we  are  not 
aware  of  other  papers  that  endogenize  the  benefits  of  forming  links  through  the  subsequent 
information  exchange.  It  is  also  noteworthy  that,  while  network  formation  games  have  a 
large  number  of  equilibria,  the  simple  structure  of  our  model  enables  us  to  derive  relatively 
sharp  results  about  environments  in  which  the  equilibrium  networks  will  lead  to  asymptotic 
learning. 

Finally,  our  model  is  related  to  the  literature  on  strategic  communication ,  pioneered 
by  the  cheap  talk  framework  of  [21],  While  cheap  talk  models  have  been  used  for  the 
study  of  information  aggregation  with  one  receiver  and  multiple  senders  (e.g.  [56])  and 
multiple  receivers  and  single  sender  (e.g.  [27]),  most  relevant  to  our  model  are  two  recent 
papers  that  consider  strategic  communication  over  general  networks,  [32]  and  [41],  A  major 
difference  between  these  works  and  ours  is  that  we  consider  a  model  where  communication  is 
allowed  for  more  than  one  time  period,  thus  enabling  agents  to  receive  information  outside 
their  immediate  neighborhood  (at  the  cost  of  a  delayed  decision)  and  we  also  endogenize 
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the  network  over  which  communication  takes  place.  On  the  other  hand,  our  framework 
assumes  that  an  agent’s  action  does  not  directly  influence  others’  payoffs,  while  such  payoff 
interactions  are  the  central  focus  of  [32]  and  [41]. 

1.3  Organization 

The  thesis  is  organized  as  follows.  In  Chapter  2,  we  develop  a  model  of  experimentation 
and  innovation  in  a  competitive  multi-firm  environment.  Moreover,  we  provide  an  explicit 
characterization  of  both  the  asymmetric  and  symmetric  equilibria,  when  firms  are  symmet¬ 
ric,  i.e. ,  they  receive  information  of  equal  precision.  Chapter  3  characterizes  the  optimal 
allocation  and  shows  that  the  efficiency  gap  between  the  symmetric  equilibrium  and  the 
optimal  allocation  can  be  arbitrarily  large.  The  analysis  in  this  chapter  also  demonstrates 
that  an  appropriately- designed  patent  system  can  implement  the  optimal  allocation  (in  all 
equilibria).  Chapter  4  extends  the  model  to  an  environment,  where  firms  have  different 
signal  qualities.  In  Chapter  5,  we  introduce  a  model  of  information  exchange  among  ratio¬ 
nal  agents,  that  are  embedded  in  a  social  network.  Chapter  6  discusses  conditions  under 
which  agents  learn  at  equilibrium  and  also  explores  the  welfare  properties  of  equilibria.  In 
Chapter  7,  we  apply  our  learning  results  to  several  models  of  random  graphs.  Chapter  8 
extends  the  information  exchange  model  by  incorporating  endogenous  network  formation. 
Finally,  Chapter  9  concludes  the  thesis  and  discusses  a  number  of  promising  avenues  for 
future  research.  Appendices  A-F  contain  additional  results  and  some  of  the  proofs  omitted 
from  the  main  text. 
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Chapter  2 


A  Model  of  Strategic 
Experimentation 


The  economy  in  the  baseline  model  consists  of  two  research  firms,  each  maximizing  the 
present  discounted  value  of  profits.  Time  is  continuous,1  and  both  firms  discount  the 
future  at  a  common  rate  r  >  0. 

Each  firm  can  implement  (“experiment  with”)  a  distinct  project.  The  success  proba¬ 
bility  of  experimentation  is  p  >  0.  The  success  or  failure  of  experimentation  by  a  firm  is 
publicly  observed.  When  experimentation  is  successful,  we  refer  to  this  as  an  “innovation” . 

At  time  t,  a  firm  can  choose  one  of  three  possible  actions:  (1)  experiment  with  a  project 
(in  particular,  with  the  project  on  which  the  firm  has  received  a  positive  signal);  (2)  copy 
a  successful  project:  (3)  wait.  Experimentation  and  copying  are  irreversible,  so  that  a  firm 
cannot  then  switch  to  implement  a  different  project.  In  the  context  of  research,  this  captures 
the  fact  that  commitment  of  intellectual  and  financial  resources  to  a  specific  research  line 
or  project  is  necessary  for  success.  Copying  of  a  successful  project  can  be  interpreted  more 
broadly  as  using  the  information  revealed  by  successful  innovation  or  experimentation,  so 

1In  the  Appendix,  we  present  a  discrete- time  version  of  the  model  and  formally  show  that  the  continuous¬ 
time  version  studied  in  the  main  text  gives  the  same  economic  and  mathematical  answers  as  the  limit  of 
the  discrete-time  model,  when  the  time  interval  A  — >  0. 
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it  does  not  need  to  correspond  to  the  second  firm  replicating  the  exact  same  innovation  (or 
product).2 

Payoffs  depend  on  the  success  of  the  project  and  whether  the  project  is  copied.  During 
an  interval  of  length  r ,  the  payoff  to  a  firm  that  is  the  only  one  implementing  a  successful 
project  is  7r \T  >  0.  In  contrast,  if  a  successful  project  is  implemented  by  both  firms,  each 
receives  ir 2r  >  0.3  The  payoff  to  an  unsuccessful  project  is  normalized  to  zero. 

Until  we  introduce  heterogeneity  in  success  probabilities,  we  maintain  the  following 
assumption.4 

Assumption  1. 

7Ti  >  7T2  >  p7T  i . 


Let  us  also  define  the  present  discounted  value  of  profits  as 


Hj  =  -jf  for  j  =  1,  2, 


and  for  future  reference,  define 


,_n2 

^TV 


(2.1) 


Clearly,  /3  G  (p,  1)  in  view  of  Assumption  1.  Assumption  1  implies  that  the  payoff  from  a 
new  innovation  is  decreasing  in  the  number  of  firms  that  adopt  it  (tti  >  tx2)  and  also  that 
the  expected  payoff  of  a  firm’s  experimentation  is  smaller  than  the  payoff  from  copying  a 
successful  innovation.  In  particular,  the  firm  prefers  to  copy  than  to  experiment  with  its 
own  research  opportunity. 


2  Several  generalizations  do  not  affect  our  qualitative  results,  and  are  not  introduced  to  reduce  notation 
and  maximize  transparency.  These  include:  (1)  copying  can  be  successful  with  some  probability  v  G  (p,  1]; 
(2)  copying  after  an  unsuccessful  experimentation  is  feasible,  but  involves  a  cost  Ih  >  0;  (3)  experimentation 
itself  involves  a  cost  Id  >  0. 

3It  will  be  evident  from  the  analysis  below  that  all  of  our  results  can  be  straightforwardly  generalized 
to  the  case  where  an  innovator  receives  payoff  Trf^V  when  copied,  whereas  the  copier  receives  7r|econdr. 
Since  this  has  no  major  effect  on  the  main  economic  insights  and  just  adds  notation,  we  do  not  pursue  this 
generalization. 

4 The  structure  of  equilibria  without  this  assumption  is  trivial  as  our  analysis  in  Section  3  shows. 
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Now  we  are  in  a  position  to  define  strategies  in  this  game.  Let  a  history  up  to  time  t 
be  denoted  by  hK  The  set  of  histories  is  denoted  by  Vf.  A  strategy  for  a  firm  is  a  mapping 
from  time  t  and  the  history  of  the  game  up  to  t.  hl,  to  the  flow  rate  of  experimentation  at 
time  t  and  the  distribution  over  projects.  Thus,  the  time  t  strategy  can  be  written  as 

u:M+  x^->R+  x  A  ({1,2}), 

where  E+  =  R+  U  {+00}  and  A  ({1,2})  denotes  the  set  of  probability  distributions  over 
the  set  of  projects  (project  available  to  the  first,  second  firm  is  labeled  1,  2  respectively), 
corresponding  to  the  choice  of  project  when  the  firm  implements  one.  The  latter  piece  of 
generality  is  largely  unnecessary  (and  will  be  omitted),  since  there  will  never  be  mixing 
over  projects  (a  firm  will  either  copy  a  successful  project  or  experiment  with  the  project 
for  which  it  has  received  a  positive  signal).  Here  a  (t,  hv)  =  (0,  ■)  corresponds  to  waiting  at 
time  t  and  a  (t,  hl)  —  (00,  j)  corresponds  to  implementing  project  j  at  time  t,  which  could 
be  experimentation  or  copying  of  a  successful  project.  Let  us  also  denote  the  strategy  of 
firm  2  =  1,2  by  =  {aflt,  .)}^0. 

History  up  to  time  t  can  be  summarized  by  two  events  a*  £  {0, 1}  denoting  whether 
the  other  firm  has  experimented  up  to  time  t  and  sf  £  {0, 1}  denoting  whether  this  choice 
was  successful.  With  a  slight  abuse  of  notation  we  will  use  both  a  (t,  hl)  and  a  ( t , ,  a\  A)  to 
denote  time  t  strategies.  We  study  subgame  perfect  equilibria  in  the  environment  defined 
above.  In  particular,  a  subgame  perfect  equilibrium  (or  simply  equilibrium)  is  a  strategy 
profile  (<7i,d2)  such  that  (ai\hk ,  a2\hk)  is  a  Nash  equilibrium  of  the  subgame  defined  by 
history  hf  for  all  histories  hl  £  Tfl,  where  at\hk  denotes  the  restriction  of  a  {  to  the  histories 
consistent  with  hk. 

2.1  Asymmetric  Equilibria 

Even  though  firms  are  symmetric  (in  terms  of  their  payoffs  and  information),  there  can 
be  symmetric  and  asymmetric  equilibria.  Our  main  interest  is  with  symmetric  equilibria, 
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where  strategies  are  independent  of  the  identity  of  the  player.  Nevertheless,  it  is  convenient 
to  start  with  asymmetric  equilibria.  These  equilibria  are  somewhat  less  natural,  because, 
as  we  will  see,  they  involve  one  of  the  players  never  experimenting  until  the  other  one  does. 
Before  describing  the  equilibria,  we  introduce  some  additional  notation.  In  particular, 
the  flow  rate  of  experimentation  a  induces  a  stochastic  distribution  of  “stopping  time,” 
which  we  denote  by  r.  The  stopping  time  r  designates  the  probability  distribution  that 
experimentation  will  happen  at  any  time  t  E  R+  conditional  on  the  other  player  not  having 
experimented  until  then.  A  pure  strategy  simply  specifies  r  E  M+.  For  example,  the 
strategy  of  experimenting  immediately  is  r  =  0,  whereas  that  of  waiting  for  the  other 
firm’s  experimentation  is  represented  by  r  =  +oo.  The  r  notation  is  convenient  to  use  for 
the  next  two  propositions,  while  in  characterizing  the  structure  of  equilibria  we  need  to  use 
a  (thus  justifying  the  introduction  of  both  notations).5 

In  an  asymmetric  equilibrium,  one  of  the  firms,  say  1,  experiments  immediately  with 
its  research  project.  Firm  2  copies  firm  I  immediately  afterwards  if  the  latter  is  successful 
and  tries  its  own  project  otherwise.  Throughout  the  paper,  when  there  are  two  firms,  we 
use  the  notation  ~  i  to  denote  the  firm  i!  ^  i. 

Proposition  1.  Suppose  that  Assumption  1  holds.  Then  there  exist  two  asymmetric  equi¬ 
libria.  In  each  equilibrium,  t,  =  0  and  r ^  =  +oo  for  i  =  1,2. 

The  proof  of  the  proposition  is  straightforward  and  can  be  therefore  omitted.  Note 
that  the  equilibria  described  above  are  not  the  only  asymmetric  equilibria  in  this  environ¬ 
ment.  Another  set  of  such  equilibria  involves  one  of  the  firms  experimenting  with  positive 
probability  (not  going  to  zero)  at  time  t  =  0  and  both  firms  using  a  constant  flow  of  experi¬ 
mentation  from  then  on.  The  crucial  feature  of  asymmetric  equilibria  is  that  they  explicitly 

JHere  we  could  follow  the  more  precise  approach  in  [63]  for  modeling  strategies  in  continuous-time 
games  with  jumps.  This  amounts  to  defining  an  extended  strategy  space,  where  stopping  times  are  defined 
for  all  t  €  R+  and  also  for  t- f  for  any  t  €  R+.  In  other  words,  strategies  will  be  piecewise  continuous 
and  right  continuous  functions  of  time,  so  that  a  jump  immediately  following  some  time  t  6  R+  is  well 
defined.  Throughout,  we  do  allow  such  jumps,  but  do  not  introduce  the  additional  notation,  since  this  is 
not  necessary  for  any  of  the  main  economic  insights  or  proofs. 
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condition  on  the  identity  of  the  firms. 


2.2  Symmetric  Equilibria 

As  mentioned  above,  asymmetric  equilibria  explicitly  condition  on  the  identity  of  the 
firm:  one  of  the  firms,  with  label  |  is  treated  differently  than  the  firm  with  label  ~  i.  This 
has  important  payoff  consequences.  In  particular,  it  can  be  verified  easily  that  firm  ~  i 
has  strictly  greater  payoffs  in  the  equilibrium  of  Proposition  1  than  firm  i.  In  addition, 
as  already  noted  in  the  previous  section,  asymmetric  equilibria  rely  on  some  degree  of 
coordination  between  the  firms,  e.g.,  one  of  them  will  not  experiment  until  the  other  one 
does.  In  this  light,  symmetric  equilibria,  where  strategies  are  not  conditioned  on  firms’ 
“labels,”  and  firms  obtain  the  same  equilibrium  payoffs  in  expectation  are  more  natural. 
In  this  section,  we  study  such  symmetric  equilibria. 

As  defined  above  a  firm’s  strategy  is  a  mapping  from  time  and  the  firm’s  information 
set  to  the  flow  rate  of  experimentation  with  a  project.  We  refer  to  a  strategy  as  pure  if  the 
flow  rate  of  experimentation  at  a  given  time  t  is  either  0  or  oo.  Our  first  result  shows  that 
there  are  no  pure-strategy  symmetric  equilibria. 


Proposition  2.  Suppose  that  Assumption  1  holds.  Then  there  exist  no  symmetric  pure- 
strategy  equilibria. 

Proof.  Suppose,  to  obtain  a  contradiction,  that  a  symmetric  pure-strategy  equilibrium 
exists.  Then  r*  =  t  G  M+  for  i  =  1,  2,  yielding  payoff 

V(t*,t*)  =  e-rtplh 

to  both  players.  Now  consider  a  deviation  r'  >  t  for  one  of  the  firms,  which  involves  waiting 
for  a  time  interval  e  and  copying  a  successful  innovation  if  there  is  such  an  innovation  during 
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this  time  interval.  As  e  — >•  0,  this  strategy  gives  the  deviating  firm  payoff  equal  to 


V  (r,  r*)  =  lime  r(t+e)  [pU2  +  (1  -  p)pUi] . 


Assumption  1  implies  that  V  (t',t*)>V  ( t*,t *),  establishing  the  result.  ■ 

Proposition  2  is  intuitive.  Asymmetric  equilibria  involve  one  of  the  firms  always  waiting 
for  the  other  one  to  experiment  and  receiving  higher  payoff.  Intuitively,  Proposition  2 
implies  that  in  symmetric  equilibria  both  firms  would  like  to  be  in  the  position  of  the  firm 
receiving  higher  payoffs  and  thus  delaying  their  own  experimentation  in  order  to  benefit 
from  that  of  the  other  firm.  These  incentives  imply  that  no  (symmetric)  equilibrium  can 
have  immediate  experimentation  with  probability  1  by  either  firm. 

Proposition  2  also  implies  that  all  symmetric  equilibria  must  involve  mixed  strategies. 
Moreover,  any  candidate  equilibrium  strategy  must  involve  copying  of  a  successful  project 
in  view  of  Assumption  1  and  immediate  experimentation  when  the  other  firm  has  experi¬ 
mented.  Therefore,  we  can  restrict  attention  to  time  t  strategies  of  the  form 


(Tcdy)  = 


(1,  ~  i)  if  =  1  and  s*  =  1, 

(1,  i)  if  at  =  1  and  st  =  0, 

(A (t),  i )  if  ah  =  0, 


(2.2) 


where  A  :  R+  — >  E+  designates  the  flow  rate  of  experimentation  at  time  t  conditional  on 
no  experimentation  by  either  firm  up  to  time  t.  Given  this  observation,  from  now  on  we 
will  work  directly  with  A(t). 

Next  we  derive  an  explicit  characterization  of  the  (unique)  symmetric  equilibrium.  The 
next  lemma  (proof  in  the  appendix)  shows  that  symmetric  equilibria  must  involve  mixing 
on  all  tel+  and  will  be  used  in  the  characterization  of  mixed-strategy  equilibria. 

Lemma  1.  The  support  of  mixed  strategy  equilibria  is  R+. 

Lemma  1  implies  that  in  all  symmetric  equilibria  there  will  be  mixing  at  all  times  (until 
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there  is  experimentation).  Using  this  observation,  Proposition  3  characterizes  the  unique 
symmetric  equilibrium.  Let  us  illustrate  the  reasoning  here  by  assuming  that  firms  use  a 
constant  flow  rate  of  experimentation  (the  proof  of  Proposition  3  relaxes  this  assumption). 
In  particular,  suppose  that  firm  ~  i  innovates  at  the  flow  rate  A  for  all  t  E  K+.  Then  the 
value  of  innovating  at  time  t  (i.e. ,  choosing  r  =  t)  for  firm  i  is 

V  (t)  =  f  \e~Xze~rz  [pll2  +  (1  -  p)  pill]  dz  +  e~xte~rtpJl2-  (2.3) 

Jo 

This  expression  uses  the  fact  that  when  firm  ~  i  is  experimenting  at  the  flow  rate  A,  the 
timing  of  its  experimentation  has  an  exponential  distribution,  with  density  Ae-At.  Then 
the  first  term  in  (2.3)  is  the  expected  discounted  value  from  the  experimentation  of  firm  ~  i 
between  0  and  t  (again  taking  into  account  that  following  an  experimentation,  a  successful 
innovation  will  be  copied,  with  continuation  value  pll2  +  (1  -p)pU1).  The  second  term 
is  the  probability  that  firm  ~  %  does  not  experiment  until  t,  which,  given  the  exponential 
distribution,  is  equal  to  e~Xt,  multiplied  by  the  expected  discounted  value  to  firm  i  when 
it  is  the  first  to  experiment  at  time  t  (given  by  e~rtpU.2)- 

Lemma  1  implies  that  V  ( t )  must  be  constant  in  t  for  all  t  eR+.  Therefore,  its  derivative 
V'  it)  must  be  equal  to  zero  for  all  t  in  any  symmetric  equilibrium  implying  that 

V'  (t)  —  \e~xte~rt  [PU2  +  (1  -  p)  pIT]  —  (r  +  A)  e~Xte~rtpU2  =  0,  (2.4) 

for  all  t.  This  equation  has  a  unique  solution: 

A*  =  . for  all  t.  (2.5) 

1  -p 

The  next  proposition  shows  that  this  result  also  holds  when  both  firms  can  use  time- varying 
experimentation  rates. 

Proposition  3.  Suppose  that  Assumption  1  holds.  Then  there  exists  a  unique  symmetric 
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equilibrium.  This  equilibrium  involves  both  firms  using  a  constant  flow  rate  of  experimen¬ 
tation  A*  as  given  by  (2.5).  Firm  i  immediately  copies  a  successful  innovation  by  firm  ~  i 
and  experiments  if  firm  ~  i  experiments  unsuccessfully. 


Proof.  Suppose  that  firm  ~  i  experiments  at  the  flow  rate  A  (/,)  at  time  f  6l+,  Let  us 
define 


m(t) 


(2.6) 


Then  the  equivalent  of  (2.3)  is 

V(t)=  f  A  (z)  e~m^e-rz  [PU2  +  (1  -  p)p n2]  dz  +  e^e~rtpU2.  (2.7) 

Jo 


Here  ffi)  A (z)e~m(z)dz  is  the  probability  that  firm  ~  i  (using  strategy  A)  will  experiment 
between  times  tx  and  t2,  and  e~mM  =  1  -  f*  A  (z)  e~m^dz  is  the  probability  that  ~  i 
has  not  experimented  before  time  t.  Thus  the  first  term  is  the  expected  discounted  value 
from  the  experimentation  of  firm  —  i-  between  0  and  t  (discounted  and  multiplied  by  the 
probability  of  this  event).  The  second  term  is  again  the  probability  that  firm  ~  i  does  not 
experiment  until  t  multiplied  by  the  expected  discounted  value  to  firm  i  when  it  is  the  first 
to  experiment  at  time  t  (given  by  e~rtpH2). 

Lemma  1  implies  that  V  (t)  must  be  constant  in  t  for  all  t  e  R+.  Since  V  (t)  is 
differentiable  in  t,  this  implies  that  its  derivative  V  ( t )  must  be  equal  to  zero  for  all  t. 
Therefore, 


V'(t)  =  X(t)e-m^e-rt[pH2  +  (l-p)pU1]-(r  +  m,(t))e-m^e-rtpU2 
=  0  for  all  t. 


Moreover,  note  that  m(t)  is  differentiable  and  rri(t)  =  A (t).  Therefore,  this  equation  is 
equivalent  to 

A  (t)  \pU2  +  (1  —p)  =  (r  +  A(i))j>n2  for  all  t.  (2.8) 
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The  unique  solution  to  (2.8)  is  (2.5),  establishing  the  uniqueness  of  the  symmetric  equilib¬ 
rium  without  restricting  strategies  to  constant  flow  rates.  ■ 

We  end  this  section  by  discussing  how  the  equilibrium  flow  rate  of  experimentation 
A*,  given  by  equation  (2.5),  is  affected  by  the  relevant  parameters.  In  particular,  consider 
increasing  (the  inequality  n2  >  p  ■  tci  would  clearly  continue  to  hold).  This  increases 
the  value  of  waiting  for  a  firm  and  leaves  the  value  of  experimenting  unchanged,  so  the 
equilibrium  flow  rate  of  experimentation  declines,  i.e. ,  increasing  7r i  reduces  fi  and  A  . 

2.3  Multiple  Firms 

Let  us  now  suppose  that  there  are  N  firms,  each  of  which  receives  a  positive  signal 
about  one  of  the  projects.  The  probability  that  the  project  that  has  received  a  positive 
signal  will  succeed  is  still  p  and  each  firm  receives  a  signal  about  a  different  project.  Let 
7rn  denote  the  flow  payoff  from  a  project  that  is  implemented  by  n  other  firms  and  define 


Once  again,  /?  =  112/11!  as  specified  in  (2.1)  and  Assumption  1  holds,  so  that  (3  >  p. 

The  following  proposition  is  established  using  similar  arguments  to  those  in  the  previous 
two  sections  and  its  proof  is  omitted. 

Proposition  4.  Suppose  that  Assumption  1  holds  and  that  there  are  N  >  2  firms.  Then 
there  exist  no  symmetric  pure-strategy  equilibria.  Moreover  the  support  of  the  mixed-strategy 
equilibria  is  R+. 

It  is  also  straightforward  to  show  that  there  exist  asymmetric  pure-strategy  equilibria. 
For  example,  when  II /nx  >  p,  it  is  an  equilibrium  for  firm  1  to  experiment  and  the  re¬ 
maining  iV  —  1  to  copy  if  this  firm  is  successful.  If  it  is  unsuccessful,  then  firm  2  experiments 
and  so  on. 

As  in  the  previous  two  sections,  symmetric  equilibria  are  of  greater  interest.  To  char- 
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acterize  the  structure  of  symmetric  equilibria,  let  us  first  suppose  that 

n„  =  n2  for  all  n  >  2  (2.9) 

and  also  to  simplify  the  discussion,  focus  on  symmetric  equilibria  with  constant  flow  rates. 
In  particular,  let  the  rate  of  experimentation  when  there  are  n  >  2  firms  be  An.  Consider  a 
subgame  starting  at  time  f0  with  n  firms  that  have  not  yet  experimented  (and  all  previous, 
N  -  n,  experiments  have  been  unsuccessful).  Then  the  continuation  value  of  firm  i  (from 
time  t0  onwards)  when  it  chooses  to  experiment  with  probability  1  at  time  t0  +  t  is 

Vn  ( t )  =  /  A„  (n  -  1)  e-A»(n-D(*-to)e-r(z-to)  [pn2  +  (1  -  p)  Vn_x]  dz  +  e-^n~^  e~rtpU2, 
Jto 

(2.10) 

where  un_i  is  the  maximum  value  that  the  firm  can  obtain  when  there  are  n  —  1  firms 
that  have  not  yet  experimented  (where  we  again  use  v  since  this  expression  refers  to 
the  continuation  value  from  time  f0  onwards).  Intuitively,  An  (n  —  1)  e~xn(n-i)(z-t0)  js  ^ie 
density  at  which  one  of  the  n  —  1  other  firms  mixing  at  the  rate  An  will  experiment  at 
time  z  G  (t0,t0  +  t).  When  this  happens,  it  is  successful  with  probability  p  and  will 
be  copied  by  all  other  firms,  and  each  will  receive  a  value  of  e~rC-*o)]q2  (discounted 
back  to  to)-  If  it  is  is  unsuccessful  (probability  1  —  p),  the  number  of  remaining  firms 
is  n  —  1|  and  this  gives  a  value  of  un_i.  If  no  firm  experiments  until  time  t,  firm  i 
chooses  to  experiment  at  this  point  and  receives  e~rtpU2-  The  probability  of  this  event 
is  1  —  f*°+t  Xn  ( n  —  1)  e~Xn<'n~1',('z~t°')dz  =  e~xn(n-i)t_  usuai;  jn  a  mjxecj  strategy  equilib¬ 
rium,  vn  ( t )  needs  to  be  independent  of  t  and  moreover,  it  is  clearly  differentiable  in  t.  So 
its  derivative  must  be  equal  to  zero.  This  implies 

An  (n  -  1)  bn 2  +  (1  -  P)  vn- i]  =  (An  (n  -  1)  +  r)  pU2.  (2.11) 
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Proposition  4  implies  that  there  has  to  be  mixing  in  all  histories,  thus 


vn  =  pIT  for  all  n  >  2. 


(2.12) 


Intuitively,  mixing  implies  that  the  firm  is  indifferent  between  experimentation  and  wait¬ 
ing,  and  thus  its  continuation  payoff  must  be  the  same  as  the  payoff  from  experimenting 
immediately,  which  is  pn2.  Combining  (2.11)  and  (2.12)  yields 


An 


rn2 

(1  —  p)  (n  —  1)  Hi 


(2.13) 


This  derivation  implies  that  in  the  economy  with  N  firms,  each  firm  starts  mixing  at 
the  flow  rate  A N.  Following  an  unsuccessful  experimentation,  they  increase  their  flow  rate 
of  experimentation  to  \n-i,  and  so  on. 

The  derivation  leading  up  to  (2.13)  easily  generalizes  when  we  relax  (2.9).  To  demon¬ 
strate  this,  let  us  relax  (2.9)  and  instead  strengthen  Assumption  1  to: 


Assumption  2. 

Ifo  >  pill  for  all  n. 

The  value  of  experimenting  at  time  t  (starting  with  n  firms)  is  now  given  by  a  general¬ 
ization  of  (2.10): 

rto+t 

Vn  ( t )  =  A„  (n  -  1)  e— Mn-1)(*— to)e— r(*-io)  ^  +  (1  -  p)  Un_i]  dz  +  e-^(n-l)te-r tpn„. 

J  to 

Again  differentiating  this  expression  with  respect  to  t  and  setting  the  derivative  equal  to 
zero  gives  the  equivalent  indifference  condition  to  (2.11)  as 

A n(n-  1)  [pnn  +  (1  -p)vn- x]  =  (A„  (n  —  l)  +  r)plln.  (2-14) 
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for  n  =  2, N.  In  addition,  we  still  have 


vn  =  pHn  for  all  n  >  2. 

Combining  this  with  (2.14),  we  obtain 

(n  -  1)  An  =  — for  n  =  2, N,  (2.15) 

1  -  p  n„_i 

and  let  us  adopt  the  convention  that  Ai  =  +oo.  Note  that  the  expression  on  the  left  hand 
side  is  the  aggregate  rate  of  experimentation  that  a  firm  is  facing  from  the  remaining  firms. 
This  derivation  establishes  the  following  proposition. 

Proposition  5.  Suppose  that  Assumption  2  holds.  Then  there  exists  a  unique  symmetric 
equilibrium.  In  this  equilibrium,  when  there  are  n  =  1,2,  ...,N  firms  that  have  not  yet 
experimented,  each  experiments  at  the  constant  flow  rate  Xn  as  given  by  (2.15).  A  successful 
innovation  is  immediately  copied  by  all  remaining  firms.  An  unsuccessful  experimentation 
starting  with  n  >  3  firms  is  followed  by  all  remaining  firms  experimenting  at  the  flow  rate 
Xn— 1  - 

An  interesting  feature  of  Proposition  5  is  that  after  an  unsuccessful  experimentation, 
the  probability  of  further  experimentation  may  decline.  Whether  this  is  the  case  or  not 
depends  on  how  fast  IIn  decreases  in  n. 
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Chapter  3 


Patents  and  Optimal  Allocations 

The  analysis  so  far  has  established  that  symmetric  equilibria  involve  mixed  strategies, 
potential  delays,  and  also  staggered  experimentation  (meaning  that  with  probability  1,  one 
of  the  firms  will  experiment  before  others).  Asymmetric  equilibria  avoid  delays,  but  also 
feature  staggered  experimentation.  Moreover,  they  are  less  natural,  because  they  involve 
one  of  the  firms  never  acting  (experimenting)  until  the  other  one  does  and  also  because  they 
give  potentially  very  different  payoffs  to  different  firms.  In  this  section,  we  first  establish 
the  inefficiency  of  (symmetric)  equilibria.  We  then  suggest  that  an  appropriately-designed 
patent  system  can  implement  optimal  allocations.  While  all  of  the  results  in  the  section 
hold  for  IV  >  2  firms,  we  focus  on  the  case  with  two  firms  to  simplify  notation. 

3.1  Welfare 

It  is  straightforward  to  see  that  symmetric  equilibria  are  Pareto  suboptimal.  Suppose 
that  there  exists  a  social  planner  that  can  decide  the  experimentation  time  for  each  firm. 
Suppose  also  that  the  social  planner  would  like  to  maximize  the  sum  of  the  present  dis¬ 
counted  values  of  the  two  firms.  Clearly,  in  practice  an  optimal  allocation  (and  thus  the 
objective  function  of  the  social  planner)  may  also  take  into  account  the  implications  of  these 
innovations  on  consumers.  However,  we  have  not  so  far  specified  how  consumer  welfare  is 
affected  by  the  replication  of  successful  innovations  versus  new  innovations.  Therefore,  in 
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what  follows,  we  focus  on  optimal  allocations  from  the  viewpoint  of  firms.  This  would  also 
be  the  optimal  allocation  taking  consumer  welfare  into  account  when  consumer  surpluses 
from  a  new  innovation  and  from  a  successful  innovation  implemented  by  two  firms  are 
proportional  to  and  2IT2 ,  respectively.  If  we  take  the  differential  consumer  surpluses 
created  by  these  innovations  into  account,  this  would  only  affect  the  thresholds  provided 
below,  and  for  completeness,  we  also  indicate  what  these  alternative  thresholds  would  be. 

The  social  planner  could  adopt  one  of  two  strategies: 

1.  Staggered  experimentation:  this  would  involve  having  one  of  the  firms  experiment  at 
t  —  0;  if  it  is  successful,  then  the  other  firm  would  copy  the  innovation,  and  otherwise 
the  other  firm  would  experiment  immediately.  Denote  the  surplus  generated  by  this 
strategy  by  Sf. 

2.  Simultaneous  experimentation :  this  would  involve  having  both  firms  experiment  im¬ 
mediately  at  t  =  0.  Denote  the  surplus  generated  by  this  strategy  by  Sf  ■ 

It  is  clear  that  no  other  strategy  could  be  optimal  for  the  planner.  Moreover,  Sf  and 
Sf  have  simple  expressions.  In  particular, 

Sf  =  2pTJ2  +  (1  -  p)  pill.  (3.1) 

Intuitively,  one  of  the  firms  experiments  first  and  is  successful  with  probability  p.  When 
this  happens,  the  other  firm  copies  a  successful  innovation,  with  total  payoff  2ir2.  With 
the  complementary  probability,  1  -  p,  the  first  firm  is  unsuccessful,  and  the  second  firm 
experiments  independently,  with  expected  payoff  pfli.  These  payoffs  occur  immediately 
after  the  first  experimentation  and  thus  are  not  discounted. 

The  alternative  is  to  have  both  firms  experiment  immediately,  which  generates  expected 
surplus 

Sf  =  2pIIi.  (3.2) 
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The  comparison  of  S f  and  SE  implies  that  simultaneous  experimentation  by  both  firms 
is  optimal  when  2/3  <  1  +  p.  In  contrast,  when  2/3  >  1  +  p,  the  optimal  allocation  involves 
one  of  the  firms  experimenting  first,  and  the  second  firm  copying  successful  innovations. 
This  is  stated  in  the  next  proposition  (proof  in  the  text). 

Proposition  6.  Suppose  that 

2/3  >  1  +  p,  (3.3) 

then  the  optimal  allocation  involves  staggered  experimentation,  that  is,  experimentation  by 
one  firm  and  copying  of  successful  innovations.  If  (3.3)  does  not  hold,  then  the  optimal 
allocation  involves  immediate  experimentation  by  both  firms.  When  2/3  =  1  +  p,  both 
staggered  experimentation  and  immediate  experimentation  are  socially  optimal. 

Note  at  this  point  that  if  consumer  surpluses  from  a  new  innovation  and  from  the  two 
firms  implementing  the  same  project  were,  respectively,  Cj  and  2C2,  then  we  would  have 

Sfi  =  2 p  (II2  +  C2)  +  (1  ~  p)p  (III  +  Ci) 

S?  =  2p(U1  +  C1). 

Denoting  7  =  (n2  +  C2)  /  (nx  +  Ci),  it  is  then  clear  that  condition  (3.3)  would  be  replaced 
by  2y  >  1  +  p  and  the  rest  of  the  analysis  would  remain  unchanged.  If  C2  =  kH-2  and 
Ci  —  kIIi,  then  this  condition  would  be  identical  to  (3.3). 

Let  us  now  compare  this  to  the  equilibria  characterized  so  far.  Clearly,  asymmetric 
equilibria  are  identical  to  the  first  strategy  of  the  planner  and  thus  generate  surplus  S[ 
(recall  subsection  2.1.).  In  contrast,  the  (unique)  symmetric  equilibrium  generates  social 
surplus 

POO 

SE  =  /  2X*e-{2y+r)t[2pU2  +  (l-p)pU1}dt  (3.4) 

Jo 

2X* 

=  ,  [2pn2  +  (1  - p) piii]  =  2pn2, 

2X  +  r 
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where  A*  is  the  (constant)  equilibrium  flow  rate  of  experimentation  given  by  (2.5).  The 
first  line  of  (3.4)  applies  because  the  time  of  first  experimentation  corresponds  to  the 
first  realization  of  one  of  two  random  variables,  both  with  an  exponential  distribution 
with  parameter  A*  and  time  is  discounted  at  the  rate  r.  If  the  first  experimentation  is 
successful,  which  has  probability  p,  surplus  is  equal  to  2n2,  and  otherwise  (with  probability 
1  —  p),  the  second  firm  also  experiments,  with  expected  payoff  pU.x.  The  second  line  is 
obtained  by  solving  the  integral  and  substituting  for  (3.1).  An  alternative  way  to  obtain 
that  SE  =  2pII2  is  by  noting  that  at  equilibrium  the  two  firms  are  mixing  with  a  constant 
flow  of  experimentation  for  all  times,  thus  the  expected  payoff  for  each  should  be  equal  to 
the  payoff  when  they  experiment  at  time  t  =  0,  i.e. ,  pfl2. 

A  straightforward  comparison  shows  that  SE  is  always  (strictly)  less  than  both  Sf 
and  Sf.  Therefore,  the  unique  symmetric  equilibrium  is  always  inefficient.  Moreover,  this 
inefficiency  can  be  quantified  in  a  simple  manner.  Let  Sp  =  max  {.S'f,  Sf }  and  consider 
the  ratio  of  equilibrium  social  surplus  to  the  social  surplus  in  the  optimal  allocation  as  a 
measure  of  inefficiency: 

_  SE 
s~  Sp ' 

Naturally,  the  lower  is  s  the  more  inefficient  is  the  equilibrium. 

Clearly,  s  <  1,  so  that  the  equilibrium  is  always  inefficient  as  stated  above.  More 
specifically,  let  us  first  suppose  that  (3.3)  holds.  Then,  the  source  of  inefficiency  is  delayed 
experimentation.  In  this  case, 

SE 

5  ~  W 

2A*  2/3 

2A*  +  r  ~  2/3  +  1  -p' 

where  the  last  equality  simply  uses  (2.5).  It  is  clear  that  s  is  minimized,  for  given  p,  as 
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f3  =  (1  +  p)  /2  (its  lower  bound  given  (3.3)).  In  that  case,  we  have 


1  +  p 

5  =  — • 

In  addition,  as  p  ^  0,  s  can  be  as  low  as  1/2. 

Next  consider  the  case  where  (3.3)  does  not  hold.  Then 

SE 

5  ~  Sf 

_  2A*  2pU2  +  (1  ~  p)  plh  _  „ 

2A*  +  r  2pUj  P’ 

where  the  last  equality  again  uses  (2.5)  and  the  definition  of  8  from  (2.1).  Since  this 
expression  applies  when  (3  <  1  +  p,  f8  can  be  arbitrarily  small  as  long  as  p  is  small  (to 
satisfy  the  constraint  that  /3  >  p),  and  thus  in  this  case  s  |  0.  In  both  cases,  the  source  of 
inefficiency  of  the  symmetric  equilibrium  is  because  it  generates  insufficient  incentives  for 
experimentation.  In  the  first  case  this  exhibits  itself  as  delayed  experimentation,  and  in 
the  second,  as  lack  of  experimentation  by  one  of  the  firms. 

This  discussion  establishes  (proof  in  the  text) . 

Proposition  7.  1.  Asymmetric  equilibria  are  Pareto  optimal  and  maximize  social  sur¬ 

plus  when  (3.3)  holds,  but  fail  to  maximize  social  surplus  when  (3.3)  does  not  hold. 

2.  The  unique  symmetric  equilibrium  is  always  Pareto  suboptimal  and  never  maximizes 
social  surplus.  When  (3.3)  holds,  this  equilibrium  involves  delayed  experimentation, 
and  when  (3.3)  does  not  hold,  there  is  insufficient  experimentation. 

3.  When  (3.3)  holds,  the  relative  surplus  in  the  equilibrium  compared  to  the  surplus  in 
the  optimal  allocation,  s,  can  be  as  small  as  1/2.  When  (3.3)  does  not  hold,  the 
symmetric  equilibrium  can  be  arbitrarily  inefficient.  In  particular,  s  4,  0  as  p  ft)  and 

no. 
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It  is  straightforward  to  verify  that  the  results  in  this  proposition  apply  exactly  if  con¬ 
sumer  surpluses  are  proportional  to  firm  profits,  i.e. ,  C2  =  «n2  and  Cx  =  /cnx.  If  this  is  not 
the  case,  then  the  worst-case  scenario  considered  in  part  3  can  become  even  worse  because 
of  the  misalignment  between  firm  profits  and  consumer  surplus  resulting  from  different 
types  of  successful  research  projects. 

3.2  Patents 

The  previous  section  established  the  inefficiency  of  the  symmetric  equilibrium  resulting 
from  delayed  and  insufficient  experimentation.  In  this  section,  we  discuss  how  patents  can 
solve  or  ameliorate  this  problem.  Our  main  argument  is  that  a  patent  system  provides 
incentives  for  greater  experimentation  or  for  experimentation  without  delay. 

We  model  a  simple  patent  system,  whereby  a  patent  is  granted  to  any  firm  that  un¬ 
dertakes  a  successful  innovation.  If  a  firm  copies  a  patented  innovation,  it  has  to  make  a 
payment  (compulsory  license  fee)  r)  to  the  holder  of  the  patent.  We  discuss  the  relationship 
between  this  payment  and  licensing  fees  in  the  next  section.  An  appropriately-designed 
patent  system  (i.e.,  the  appropriate  level  of  77)  can  achieve  two  objectives  simultaneously. 
First,  it  can  allow  firms  to  copy  others  when  it  is  socially  beneficial  for  the  knowledge  cre¬ 
ated  by  innovations  to  spread  to  others  (and  prevent  it  when  it  is  not  beneficial).  Second, 
it  can  provide  compensation  to  innovators,  so  that  incentives  to  free-ride  on  others  are 
weakened.  In  particular,  when  staggered  experimentation  is  optimal,  a  patent  system  can 
simultaneously  provide  incentives  to  one  firm  to  innovate  early  and  to  the  other  firm  to 
copy  an  existing  innovation.  When  rj  is  chosen  appropriately,  the  patent  system  provides 
incentives  for  the  ex  post  transfer  of  knowledge.  However,  more  crucially,  it  also  encour¬ 
ages  innovation  because  an  innovation  that  is  copied  becomes  more  profitable  than  copying 
another  innovation  and  paying  the  patent  fee.  The  key  here  is  that  the  incentives  provided 
by  the  patent  system  are  “conditional”  on  whether  the  other  firm  has  experimented  or 
not,  and  thus  induce  an  “asymmetric”  response  from  the  two  firms.  This  makes  innovation 
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relatively  more  profitable  when  the  other  firm  copies  and  less  profitable  when  the  other 
firm  innovates.  This  incentive  structure  encourages  one  of  the  firms  to  be  the  innovator 
precisely  when  the  other  firm  is  copying.  Consequently,  the  resulting  equilibria  resemble 
asymmetric  equilibria.  Moreover,  these  asymmetric  incentives  imply  that,  when  the  patent 
system  is  designed  appropriately,  a  symmetric  equilibrium  no  longer  exists.  It  is  less  prof¬ 
itable  for  a  firm  to  innovate  when  the  other  firm  is  also  innovating,  because  innovation  no 
longer  brings  patent  revenues.  Conversely,  it  is  not  profitable  for  a  firm  to  wait  when  the 
other  firm  waits,  because  there  is  no  innovation  to  copy  in  that  case. 

Our  main  result  in  this  section  formalizes  these  ideas.  We  state  this  result  in  the 
following  proposition  and  then  provide  most  of  the  proof,  which  is  intuitive,  in  the  text. 

Proposition  8.  Consider  the  model  with  two  firms.  Suppose  that  Assumption  1  holds. 
Then: 

1 


2 

T)  >  n2  -  pill. 

That  is,  there  exists  a  unique  equilibrium  in  which  both  firms  immediately  experiment. 


When  (3.3)  holds,  a  patent  system  with 


7]  £ 


(1  T-  ,n2-Pn, 


(which  is  feasible  in  view  of  (3.3)),  implements  the  optimal  allocation,  which  involves 
staggered  experimentation,  in  all  equilibria.  That  is,  in  all  equilibria  one  firm  ex¬ 
periments  first,  and  the  other  one  copies  a  successful  innovation  and  experiments 
immediately  following  an  unsuccessful  experimentation. 

When  (3.3)  does  not  hold,  then  the  optimal  allocation,  which  involves  simultaneous 
experimentation,  is  implemented  as  the  unique  equilibrium  by  a  patent  system  with 
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Let  us  start  with  the  first  claim  in  Proposition  8.  We  outline  the  argument  for  why 

V  <  fl2  —  pfiL  implies  that  there  exists  an  equilibrium  with  staggered  experimentation,  and 

V  >  L— ensures  that  other  equilibria,  which  involve  delayed  experimentation,  are  ruled 
out.  Observe  that  since  77  <  fl2  —  pTl\,  the  equilibrium  involves  copying  of  a  successful 
innovation  by  a  firm  that  has  not  acted  yet.  However,  incentives  for  delaying  to  copy  are 
weaker  because  copying  now  has  an  additional  cost  77,  and  innovation  has  an  additional 
benefit  77  if  the  other  firm  is  imitating.  Suppose  that  firm  ~  1  will  innovate  at  some  date 
T  >  0  (provided  that  firm  i  has  not  done  so  until  then).  Then  the  payoffs  to  firm  i  when 
it  chooses  experimentation  and  waiting  are 

experiment  now  =  p  (n2  +  77) 

wait  =  e~rT  (p  (n2  -  77)  +  (1  -  p)pU1) . 

It  is  clear  that  for  any  T  >  0,  experimenting  is  a  strict  best  response,  since 

P  (n2  +  77)  >  P  (n2  -  77)  +  (1  -  p)  pUx 

given  that  77  >  — So  experimenting  immediately  against  a  firm  that  is  waiting  is 
optimal.  To  show  that  all  equilibria  implement  the  optimal  allocation,  we  also  need  to 
show  that  both  firms  experimenting  immediately  is  not  an  equilibrium.  Suppose  they  did 
so.  Then  the  payoff  to  each  firm,  as  a  function  of  whether  they  experiment  or  wait,  would 
be 


experiment  now  =  p[Ii 

wait  =  p(n2 -77)  + (1 -p)plli. 
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Waiting  is  a  strict  best  response  since 


p  (n2  -  rj)  4-  (1  -  p)  pill  >  pill 

which  holds  in  view  of  the  fact  that  p  <  II2  -  pUi-  This  argument  makes  it  intuitive  that 
patents  induce  an  equilibrium  structure  without  delay:  waiting  is  (strictly)  optimal  when 
the  other  firm  is  experimenting  immediately  and  experimenting  immediately  is  (strictly) 
optimal  when  the  other  firm  is  waiting.  To  establish  this  claim  formally,  we  need  to  prove 
that  there  are  no  mixed  strategy  equilibria.  This  is  done  in  the  next  lemma. 

Lemma  2.  When  equation  (3.3)  holds,  there  does  not  exist  any  equilibrium  with  mixing. 

Proof.  Let  us  write  the  expected  present  discounted  value  of  experimenting  at  time  t  for 
firm  i  when  firm  ~  i  experiments  at  the  flow  rate  A  (t)  as  in  (2.7)  in  the  proof  of  Proposition 
3  except  that  we  now  take  patent  payments  into  account  and  use  equation  (3.3)  so  that 
copying  a  successful  innovation  is  still  profitable.  This  expression  is 

V  (t)  =  [  A  {z)  e-m[z)e~TZ  [p  (n2  -  p)  +  (1  -  p)  pIL]  dz  +  e”m(t)e'rtp  (II2  +  rj) , 

Jo 

where  m  (t)  is  given  by  (2.6)  in  the  proof  of  Proposition  3.  This  expression  must  be  constant 
for  all  t  in  the  support  of  the  mixed-strategy  equilibrium.  The  argument  in  the  proof  of 
Proposition  3  establishes  that  A  (t)  must  satisfy 

X(t)  [p(n2  -  p)  +  (1  -p)plli]  =  (r  +  \{t))p  (n2  +  rj)  ■ 

It  can  be  verified  easily  that  since  rj  >  ^  ^ni  this  equation  cannot  be  satisfied  for  any 
A  (t)  €  R+  (for  any  t).  Therefore,  there  does  not  exist  any  equilibrium  with  mixing.  ■ 
Let  us  next  turn  to  the  second  claim  in  the  proposition.  Suppose  that  (3.3)  is  not 
satisfied  and  let  p  >  n2  -  pIL .  Then  it  is  not  profitable  for  a  firm  to  copy  a  successful 
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innovation.  Therefore,  both  firms  have  a  unique  optimal  strategy  which  is  to  experiment 
immediately,  which  coincides  with  the  optimal  allocation  characterized  in  Proposition  6. 

The  intuition  for  the  results  in  Proposition  8  can  also  be  obtained  by  noting  that  the 
patent  system  is  inducing  experimenters  to  internalize  the  externalities  that  they  create. 
Let  us  focus  on  part  1  and  suppose  that  firm  1  experiments  while  firm  2  delays  and  copies 
a  successful  innovation  by  firm  1.  In  this  case,  the  social  surplus  is  equal  to  2pn2  + 
(1  Firm  1  only  receives  pU2  without  a  patent,  and  if  it  were  to  deviate  and 

delay  experimentation,  firm  2  would  instead  receive  pll2.  Thus  to  internalize  the  positive 
externality  that  it  is  creating,  firm  1  needs  to  be  compensated  for  (1  -  p)pUi.  A  licensee 
fee  of  rj  >  — ■ achieves  this,  since  by  experimenting  firm  1  receives  this  license  fee  with 
probability  p  and  by  delaying,  it  would  have  had  to  pay  the  same  license  fee  with  probability 
p  (and  thus  2 prj  >  (1  -  p)pl[l).  The  requirement  that  rj  <  U2  -  pji\  then  simply  ensures 
that  firm  2  indeed  wishes  to  copy  the  innovation  despite  the  license  fee. 

The  preceding  discussion  and  Proposition  8  show  how  an  appropriately-designed  patent 
system  can  be  useful  by  providing  stronger  incentives  for  experimentation.  When  simul¬ 
taneous  experimentation  by  all  parties  is  socially  beneficial,  a  patent  system  can  easily 
achieve  this  by  making  copying  (or  “free-riding”)  unprofitable.  On  the  other  hand,  when 
ex  post  transfer  of  knowledge  is  socially  beneficial,  the  patent  system  can  instead  ensure 
this  while  also  preventing  delays  in  all  equilibria.  It  is  important  to  emphasize  that,  in 
the  latter  case,  the  patent  system  provides  such  incentives  selectively ,  so  that  only  one 
of  the  firms  engages  in  experimentation  and  the  other  firm  potentially  benefits  from  the 
innovation  of  the  first  firm.  In  contrast  to  patents,  simple  subsidies  to  research  could  not 
achieve  this  objective.  This  is  stated  in  the  next  proposition  and  highlights  the  particular 
utility  of  a  patent  system  in  this  environment. 

Proposition  9.  Suppose  that  equation  (3.3)  holds.  Consider  a  direct  subsidy  w  >  0  given 
to  a  firm  that  experiments.  There  exists  no  w  >  0  such  that  all  equilibria  with  subsidies 
correspond  to  the  optimal  allocation. 
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Proof.  This  is  straightforward  to  see.  If  w  >  T\  >-p\\i,  there  exists  an  equilibrium  in  which 
both  firms  experiment  immediately  and  if  w  <  IT2  —  pIT,  the  symmetric  mixed-strategy 
equilibrium  with  delayed  experimentation  survives.  ■ 

It  is  also  clear  that  the  same  argument  applies  to  subsidies  to  successful  innovation  or 
any  combination  of  subsidies  to  innovation  and  experimentation. 

3.3  Patents  and  License  Fees 

The  analysis  in  the  previous  section  assumed  that  a  firm  can  copy  a  successful  innovation 
and  in  return  it  has  to  make  some  pre-specified  payment  (compulsory  license  fee)  77  to  the 
original  innovator.  In  practice  patents  often  provide  exclusive  rights  to  the  innovator,  who 
is  then  allowed  to  license  its  product  or  discovery  to  other  firms.  If  so,  the  license  fee  rj 
would  need  to  be  negotiated  between  the  innovator  and  the  (potential)  copying  firm  rather 
than  determined  in  advance.  While  such  voluntary  licensing  is  an  important  aspect  of  the 
patent  system  in  practice,  it  is  not  essential  for  the  theoretical  insights  we  would  like  to 
emphasize. 

To  illustrate  this,  let  us  suppose  that  the  copying  firm  is  developing  a  different  but  highly 
substitutable  product  to  the  first  innovation.  Suppose  further  that  the  patent  system  gives 
exclusive  rights  to  the  innovator  but  if  the  second  firm  copies  a  successful  innovation,  the 
court  system  needs  to  determine  damages.  How  the  court  system  functions  is  also  part  of  the 
patent  system.  In  particular,  suppose  that  if  a  firm  copies  a  successful  innovation  without 
licensing  and  the  innovator  brings  a  lawsuit,  it  will  succeed  with  probability  p  £  (0, 1)  and 
the  innovator  will  receive  damages  equal  to  k  (nx  —  n2),  where  k  >  0.  We  ignore  legal 
fees.  Given  this  legal  environment,  let  us  interpret  77  as  a  license  fee  negotiated  between 
the  potential  copying  firm  and  the  innovator.  For  simplicity,  suppose  that  this  negotiation 
can  be  represented  by  a  take-it-or-leave-it  offer  by  the  innovator  (this  has  no  effect  on  the 
conclusions  of  this  section).  If  the  two  firms  agree  to  licensing,  their  joint  surplus  is  2n2.  If 
they  disagree,  then  the  outside  option  of  the  copying  firm  is  max  {jffli;  n2  —  pn  (Hi  -  n2)}, 
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where  the  max  operator  takes  care  of  the  fact  that  the  best  alternative  for  the  “copying” 
firm  may  be  to  experiment  if  there  is  no  explicit  licensing  agreement.  Without  licensing, 
the  innovator  will  receive  an  expected  return  of  II2  +  pn  (ITi  —  n2)  if  n2  -  pn  (IT  -  n2)  > 
pfli  and  III  otherwise.  This  implies  that  the  negotiated  license  fee,  as  a  function  of  the 
parameters  of  the  legal  system,  will  be 


V  (P,  k)  =  < 


pk  (it  -  n2) 
n2  -  piii 

00 


if  piii  <  n2  -  pn  (iii  -  n2) , 

if  piii  >  II2  —  pn  (Iii  —  n2)  and  2II2  >  Eli, 
otherwise, 


where  00  denotes  a  prohibitively  expensive  license  fee,  such  that  no  copying  takes  place. 
Clearly,  by  choosing  p  and  k,  it  can  be  ensured  that  p  (p,  k)  is  greater  than  II2  —  piii  when 
(3.3)  does  not  hold  and  is  between  (1  -  p)  II x / 2  and  n2-plli  when  it  holds.  This  illustrates 
how  an  appropriately-designed  legal  enforcement  system  can  ensure  that  equilibrium  license 
fees  play  exactly  the  same  role  as  the  pre-specified  patent  fees  did  in  Proposition  8. 


48 


Chapter  4 


Firms  with  Heterogeneous 
Information 


Fort  the  remainder,  we  relax  the  assumption  that  all  firms  receive  signals  with  identical 
precision.  Instead,  now  signal  quality  differs  across  firms.  We  continue  to  assume  that  each 
firm  receives  a  positive  signal  about  a  single  project.  But  the  information  content  of  these 
signals  differs.  We  parameterize  signal  quality  by  the  probability  with  which  the  indicated 
project  is  successful  and  denote  it  by  p  (or  by  pt  for  firm  z).  We  distinguish  two  cases. 
First,  we  discuss  the  case  when  signals  are  publicly  known  (we  limit  the  discussion  to  two 
firms)  and,  then,  we  study  the  case  when  the  signals  are  drawn  from  a  known  distribution 
represented  by  the  cumulative  distribution  function  G  (p)  ( G  is  assumed  to  have  strictly 
positive  and  continuous  density  g  (p)  over  its  support  [a,  b]  C  [0, 1])  and  the  realization  of 
p  for  each  firm  is  independent  of  the  realizations  for  others  and  is  private  information.  For 
the  case  of  private  signals  we  discuss  the  case  of  two  firms  in  the  main  text  and  relegate  a 
discussion  on  the  extension  to  multiple  firms  to  Appendix  B.  Finally,  throughout  we  focus 
on  the  equivalent  of  symmetric  equilibria  where  strategies  do  not  depend  on  firm  identity. 
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4.1  Publicly  Known  Signals 

Let  pi ,  p2  denote  the  signals  of  firms  1  and  2  respectively.  We  also  impose: 

Assumption  3. 

7Ti  >  7T2  >  min {pi,p2}  • 


Note  that  Assumption  3  implies  that  firm  i  =  argmin{p!,p2}  would  find  it  optimal  to 
copy  firm  ~  i,  if  the  latter  was  successful  at  experimentation.  Also,  note  that  when 
min{pi,p2}  •  tti  >  7t2  the  structure  of  the  equilibrium  is  straightforward.  Let  us  consider 
the  following  two  cases:  (1)  max{pi,p2}  •  7Ti  >  tx2  and  (2)  max{p1)p2}  •  7 iq  <  ix2.  The  next 
proposition  characterizes  the  unique  equilibrium  (in  fully  mixed  strategies  prior  to  any 
experimentation)  in  both  cases  (the  proof  is  omitted  as  it  uses  similar  arguments  to  that  of 
Proposition  3).  For  the  remainder  of  the  section,  let  pmax  =  max{pi,p2},  Pmm  =  min {pi,p2} 
and  similarly  irnax  =  arg max{pl5p2}  and  fmin  =  argmin{pi,p2}. 

Proposition  10.  Suppose  that  Assumption  3  holds.  Then,  there  exists  a  unique  equilibrium 
in  fully  mixed  strategies  prior  to  any  experimentation.  In  particular: 

(1)  Suppose  pmax  •  7!^  <  7r2.  Then,  in  the  unique  fully  mixed  equilibrium,  firm  1  uses  the 

constant  flow  rate  of  experimentation  and  firm  2  uses  the  rate 

A2  =  jjZpoyplPf  (^2_pi)g-  Firm  i  immediately  copies  a  successful  innovation  by  firm  ~  i 
and  experiments  if  ~  i  experiments  unsuccessfully. 

(2)  Suppose  max{pi,p2}  •  tt x  >  7t2.  Then,  in  the  unique  fully  mixed  equilibrium,  firm  imin 
uses  the  constant  flow  rate  of  experimentation  Amin  =  and  flrm  imax  uses  fhe  raie 
Amax  =  (p-p^”)pmax  •  Firm  imin  immediately  copies  a  successful  innovation  by  firm 
imax  and  experiments  if  imax  experiments  unsuccessfully.  On  the  other  hand,  if  imin 
experiments  first,  then  zmax  experiments  with  its  own  research  project  (does  not  copy 
the  potential  innovation). 

It  is  worth  noting  that  when  max{pi,p2}-7ri  >  7t2,  firm  irnax  delays  experimentation  not 


50 


to  copy  a  potential  innovation  by  firm  Vin  but  so  as  not  to  get  copied  by  imin.  Proposition 
11  is  analogous  to  Proposition  6  and  describes  the  optimal  allocation  in  this  setting  (proof 
is  omitted). 


Proposition  11.  Suppose  that 


2/3  >  1  +  Pmin , 


(4.1) 


then  the  optimal  allocation  involves  staggered  experimentation,  that  is,  experimentation 
by  firm  imax  first  and  copying  of  successful  innovations.  If  (f.l)  does  not  hold,  then  the 
optimal  allocation  involves  immediate  experimentation  by  both  firms.  When  2/3  =  1  +  Pmin, 
both  staggered  experimentation  and  immediate  experimentation  are  socially  optimal. 

Moreover,  we  can  show  that  a  patent  system  with: 


V  e 


min 


jn2  PmaxH-l) 


(1  Pmin)  PmaxHl  (Pmax  Pmin)  H2 
Pi  +P2 


implements  the  optimal  allocation  in  all  equilibria,  when  (4.1)  holds.  When  (4.1)  does 
not  hold,  then  the  optimal  allocation  is  implemented  as  the  unique  equilibrium  by  a  patent 
system  with  77  >  n2-pminni  (the  claim  follows  by  similar  arguments  to  those  in  Proposition 
8).  An  interesting  feature  of  the  optimal  allocation  illustrated  by  Proposition  11  is  that 
it  involves  a  monotonicity ,  whereby  the  firm  with  the  strongest  signal  experiments  earlier 
(no  later)  than  the  firm  with  the  weaker  signal.  Yet,  this  monotonicity  does  not  necessarily 
hold  at  equilibrium,  since  there  is  a  positive  .probability  that  the  firm  with  the  weaker  signal 
(imin)  experiments  before  the  firm  with  the  stronger  signal  (imax). 

4.2  Private  Signals 

For  the  remainder,  we  assume  that  p’s  are  drawn  independently  from  a  known  distri¬ 
bution  with  cumulative  distribution  function  G(p)  and  continuous  density  g(p)  over  its 
support.  As  the  title  of  the  section  indicates,  the  realization  of  p’s  are  private  information. 
We  start  with  the  following  lemma,  which  follows  from  the  definition  of  ft  in  (2.1).  It  will 
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play  an  important  role  in  the  analysis  that  follows  (proof  omitted). 

Lemma  3.  Suppose  that  firm  ~  i  has  innovated  successfully.  If  p{  >  (3,  firm  i  prefers  to 
experiment  with  its  own  project.  If  pi  <  f3,  firm  i  prefers  to  copy  a  successful  project. 

Proposition  12  below  provides  a  characterization  of  the  unique  symmetric  equilibrium 
with  two  firms.  We  show  that  the  equilibrium  takes  the  following  form:  firms  with  strong 
signals  (in  particular,  p  >  (3)  experiment  immediately,  while  those  with  weaker  signals  (i.e. , 
P  <  (3)  experiment  at  time  r(p)  with  t(3)  —  0,  unless  there  has  been  experimentation  at 
any  earlier  time.  Function  r(p)  is  strictly  decreasing  and  maps  signals  to  time  of  exper¬ 
imentation  provided  that  the  other  player  has  not  yet  experimented.  The  proof  of  the 
proposition  uses  a  series  of  lemmas  and  is  relegated  to  the  Appendix. 


Proposition  12.  Let  the  support  of  G  be  [ a,b ]  C  [0, 1]  and  define  b  =  min{/3,  b}  and 


r(p)  = 


r(3G  (b) 


log  G(b)  (l  -  b)  -  log  G(p)  (1  -p)+  j  log  G(z)dz 

J  V 


(4.2) 


Then  the  unique  symmetric  equilibrium  involves: 


,  .  f  0  ifp  >  P 

T  (P)  =  S 

[  f{p)  ifpe[a,p) 

That  is,  firms  with  p  >  (3  experiment  immediately  and  firms  with  p  e  [a,  (3)  experiment  at 
time  f  (p)  unless  there  has  been  an  experimentation  att  <  f(p).  If  there  is  experimentation 
at  t  <  T  (p) ,  then  a  firm  with  p  £  [a,  f3)  copies  it  if  the  previous  attempt  was  successful  and 
experiments  immediately  if  it  was  unsuccessful. 

A  particularly  simple  example  to  illustrate  Proposition  12  is  obtained  when  G  is  uniform 
over  [a,  b]  for  0  <  a  <  b  <  f3.  In  that  case 


r  (p)  =  —  [p  -  logp  -  6  +  log b]  for  all  p  £  [a,  b] . 


(4.3) 
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An  interesting  feature  of  the  symmetric  equilibria  in  this  case  is  evident  from  (4.3):  for  a 
arbitrarily  close  to  0,  experimentation  may  be  delayed  for  arbitrarily  long  time.  It  can  be 
verified  from  (4.2)  that  this  is  a  general  feature  (for  types  arbitrarily  close  to  to  the  lower 
support  a,  -  log  (7  (p)  is  arbitrarily  large). 

4.3  Welfare 

In  this  section,  we  discuss  welfare  in  the  environment  with  private,  heterogeneous  signals 
studied  in  the  previous  section.  In  particular,  consider  a  social  planner  that  is  interested 
in  maximizing  total  surplus  (as  in  Section  3.1.  What  the  social  planner  can  achieve  will 
depend  on  her  information  and  on  the  set  of  instruments  that  she  has  access  to.  For 
example,  if  the  social  planner  observes  the  signal  quality,  p,  for  each  firm,  then  she  can 
achieve  a  much  better  allocation  than  the  equilibrium  characterized  above.  However,  it 
is  more  plausible  to  limit  the  social  planner  to  the  same  information  structure.  In  that 
case,  the  social  planner  will  have  to  choose  either  the  same  equilibrium  allocation  as  in  the 
symmetric  equilibria  characterized  in  the  previous  two  sections,  or  she  will  implement  an 
asymmetric  equilibrium,  where  one  of  the  firms  is  instructed  to  experiment  first  regardless 
of  its  p  (this  cannot  be  conditioned  on  p  since  p  is  private  information).1 

More  specifically,  let  us  focus  on  the  economy  with  two  firms  and  suppose  that  the  sup¬ 
port  of  G  is  [a,  6]  C  [0,/3].  In  this  case,  without  eliciting  information  about  the  realization 
of  firm  types,  the  p’s,  the  planner  has  three  strategies. 

1.  Staggered  asymmetric  experimentation:  in  this  case,  the  social  planner  would  instruct 
one  of  the  firms  to  experiment  immediately  and  then  have  the  other  firm  copy  if  there 
is  a  successful  innovation.  Since  the  social  planner  does  not  know  the  p’s,  she  has 
to  pick  the  experimenting  firm  randomly.  We  denote  the  social  surplus  generated  by 
this  strategy  by  Sf. 

Wet  another  alternative  is  to  specify  exactly  the  instruments  available  to  the  planner  and  characterize 
the  solution  to  a  mechanism  design  problem  by  the  planner.  However,  if  these  instruments  allow  messages 
and  include  payments  conditional  on  messages,  the  planner  can  easily  elicit  the  necessary  information  from 
the  firms. 
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2.  Staggered  equilibrium  experimentation :  alternatively,  the  social  planner  could  let  the 
firms  play  the  symmetric  equilibrium  of  the  previous  two  sections,  whereby  a  firm  of 
type  p  will  experiment  at  time  r  (p)  unless  there  has  previously  been  an  experimen¬ 
tation  by  the  other  firm.  We  denote  the  social  surplus  generated  by  this  strategy  by 
SE,  since  this  is  the  same  as  the  equilibrium  outcome.2 

3.  Simultaneous  experimentation :  in  this  case,  the  social  planner  would  instruct  both 
firms  to  experiment  immediately.  We  denote  the  social  surplus  generated  by  this 
strategy  by  S% . 

The  social  surpluses  from  these  different  strategies  are  given  as  follows.  In  the  case  of 
staggered  asymmetric  experimentation,  we  have 

+  (1_p,)(i 

In  contrast,  the  expected  surplus  from  the  unique  (mixed-strategy)  symmetric  equilibrium 
can  be  written  as 


p2dG  (p2)  n 


dG(Pl). 


S 


Pi2n2 


sE  = 


g— rr(max{pi  ,P2  }) 


max{pi,p2}2n2  +  (1 


max {pi,p2})  f  min-fpi,^}!!! 
J  a 


dG  (pi)  dG  (p2) . 


Intuitively,  this  expression  follows  by  observing  that  in  the  equilibrium  as  specified  in 
Proposition  20,  the  firm  with  the  stronger  signal  (higher  p)  will  experiment  first,  so  there 
will  be  delay  until  max {pi,p2}-  At  that  point,  this  firm  will  succeed  with  probability 
max  {pi,p2},  in  which  case  the  second  firm  will  copy.  If  the  first  firm  fails  (probability  1  - 

2Without  eliciting  information  about  firm  types  and  without  using  additional  instruments,  the  social 
planner  cannot  implement  another  monotone  staggered  experimentation  allocation.  For  example,  she  could 
announce  that  if  there  is  no  innovation  until  some  time  t  >  0,  one  of  the  firms  will  be  randomly  forced 
to  experiment.  But  such  schemes  will  not  preserve  mono  tonicity,  since  at  time  t,  it  may  be  the  firm  with 
lower  p  that  may  be  picked  for  experimentation.  In  the  next  section,  we  discuss  how  she  can  implement 
better  allocations  using  patent  payments. 
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max  {p [ ,  p2 } ) •  then  the  second  firm  experiments  and  succeeds  with  probability  min  {pi,^}- 
Since  both  pi  and  p-2  are  randomly  drawn  independently  from  G,  we  integrate  over  G  twice 
to  find  the  expected  surplus. 

The  surplus  from  simultaneous  experimentation,  on  the  other  hand,  takes  a  simple  form 
and  is  given  by 

S[  =  2Th  [  pdG  (p) , 

J  a 

since  in  this  case  each  firm  is  successful  and  generates  payoff  Tlx  with  probability  p  dis¬ 
tributed  with  distribution  function  G. 

In  this  case,  there  is  no  longer  any  guarantee  that  max  {S'f ,  <S'Lf}  >  SE .  Therefore,  the 
symmetric  equilibrium  may  generate  a  higher  expected  surplus  (relative  to  allocations  in 
which  the  social  planner  does  not  have  additional  instruments).  To  illustrate  this,  let  us 
consider  a  specific  example,  where  p  has  a  uniform  distribution  over  [0,  3).  In  this  case, 
staggered  asymmetric  experimentation  gives 


nP 

2p1U2  +  (1  -  Pi)P2^idp2dpi  =  n2 


i 


whereas  simultaneous  experimentation  gives 

fP 

s[  =  /  2Pn1dp  =  pn-i  =  n2. 

Jo 

Comparing  simultaneous  experimentation  and  staggered  asymmetric  experimentation,  we 
can  conclude  that  S'f"  >  SE  whenever  <3  >  2/3  and  Sf  <  SE  whenever  d  <  2/3,  showing 
that,  as  in  the  case  with  common  signals,  either  simultaneous  or  staggered  experimentation 
might  be  optimal.  Next,  we  can  also  compare  these  surpluses  to  SE.  Since  p  is  uniformly 
distributed  in  [0,  j3\,  (4.2)  implies  that 

r(p)  =  ~  [p  -  logp  -P  +  log0]. 


55 


As  a  consequence,  max{p1,p2}  has  a  Beta(2,l)  distribution  (over  [0,  0})  while  min{p1,p2}  is 
distributed  Beta(l,2).  Then  evaluating  the  expression  for  SE,  we  find  that  when  0  <  0  < 
2/3,  SE  >  SE,  so  simultaneous  experimentation  gives  the  highest  social  surplus.  When 
2/3  <  0  <  0*  ~  0.895,  Sf  >  SE,  so  that  staggered  asymmetric  experimentation  gives 
the  highest  social  surplus.  Finally,  when  0*  <  0  <  1,  SE  >  S[  >  5f ,  so  the  symmetric 
equilibrium  gives  higher  social  surplus  than  both  staggered  asymmetric  experimentation 
and  simultaneous  experimentation. 

Finally,  it  is  also  straightforward  to  see  that  by  choosing  G  to  be  highly  concentrated 
around  a  particular  value  p,  we  show  that  the  symmetric  equilibrium  can  be  arbitrarily 
inefficient  relative  to  the  optimal  allocation. 


4.4  Equilibrium  with  Patents 


Equilibria  with  patents  are  also  richer  in  the  presence  of  heterogeneity.  Let  us  again  focus 
on  the  case  in  which  there  are  two  firms.  Suppose  that  there  is  a  patent  system  identical 
to  the  one  discussed  in  Section  3.2,  whereby  a  firm  that  copies  a  successful  innovation  pays 
t]  to  the  innovator.  Let  us  define 


n2  -  v 


IL 


(4.4) 


It  is  clear,  with  a  reasoning  similar  to  Lemma  3,  that  only  firms  with  p  <  pv  will  copy 
when  the  patent  system  specifies  a  payment  of  77.  The  next  proposition  characterizes  the 
structure  of  equilibria  with  patents  (the  proof  is  relegated  to  the  Appendix). 


Proposition  13.  Suppose  that  there  are  two  firms  and  the  patent  system  specifies  a  payment 
V  >  0  for  copying.  Let  p 71  be  given  by  (4.4),  the  support  of  G  be  [a,b]  C  [0, 1],  and  define 
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b  =  min  {b,pv}  and 


f 11  (p)  = 

1 

r(U2  +  p)G{b) 


log  G  ( b )  (ill  —  2 rj  —  6IIi)  —  log  G  ( p )  (III  —  2?7  —  pIT)  +  Hi  f  log  G  ( z )  cfe 


(4.5) 


T/zen  t/ie  unique  symmetric  equilibrium  involves: 

{0  if  p>  pv 

■ 

Tn  (p)  ifpe[a,pv) 

That  is,  firms  withp  >  p v  experiment  immediately  and  firms  ivith  p  G  [a,pv)  experiment  at 
time  Tv  (p)  unless  there  has  been  an  experimentation  at  t  <  fv  (p) . 

Moreover,  a  higher  p  tends  to  reduce  delay.  In  particular: 

•  for  any  rj  >  p  such  that  b  <  pv  and  b  <pv' ,  we  have  r77'  (p)  <  rv  (p)  for  all  p  G  [a,  b\, 
with  strict  inequality  whenever  r77  (p)  >  0; 

•  for  any  rf  such  that  b  >  pn' ,  there  exists  p*  ( rf )  G  [0 ,p77')  such  that  r77  (p)  is  decreasing 
in  p  starting  at  p  =  rf  for  all  p  G  [p*  (p1)  ,pn’]?  with  strict  inequality  whenever 
t71'  (p)  >  0. 

Note  that  the  first  bullet  point  considers  the  case  when  all  firms  would  prefer  to  copy 
a  successful  innovation  than  to  experiment  on  their  own  (since  b  <  pv  <  pn).  whereas 
the  second  bullet  point  considers  the  case  when  there  is  a  positive  probability  that  a  firm 
obtains  a  strong  enough  signal  and  prefers  to  experiment  on  its  own. 

The  result  highlights  an  important  role  of  patents  in  experimentation.  When  p  increases, 
r  (p)  tends  to  become  “steeper”  so  that  there  is  less  delay  and  thus  “time  runs  faster” .  In 
particular,  whenever  p71  <  b,  r  (p)  is  reduced  by  an  increase  in  patent  payments.  When 
p77  >  b ,  this  does  not  necessarily  apply  for  very  low  p’s,  but  is  still  true  for  high  p’s. 
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Overall,  this  result  implies  that  as  in  the  case  with  common  p's,  patents  tend  to  increase 
experimentation  incentives  and  reduce  delay.  In  the  limit,  when  rj  becomes  arbitrarily 
large,  the  equilibrium  involves  simultaneous  experimentation.  Nevertheless,  as  discussed  in 
Section  4.3,  simultaneous  experimentation  may  not  be  optimal  in  this  case. 

Alternatively  (and  differently  from  Proposition  13),  a  patent  system  can  also  be  chosen 
such  that  the  socially  beneficial  ex  post  transfer  of  knowledge  takes  place.  In  particular, 
suppose  that  there  has  been  an  innovation  and  the  second  firm  has  probability  of  success 
equal  to  p.  In  this  case,  social  surplus  is  equal  to  2n2  if  there  is  copying,  and  it  is  equal 
to  III  +  pldi  if  the  second  firm  is  forced  to  experiment.  This  implies  that  to  maximize  ex 
post  social  welfare,  firms  with  p  <  2(3  —  1  should  be  allowed  to  copy,  whereas  firms  with 
p  >  2/3  —  1  should  be  induced  to  experiment.  Clearly,  from  (4.4)  choosing  p  =  ITi  —  n2 
achieves  this.  Naturally,  from  Proposition  13,  this  will  typically  lead  to  an  equilibrium  with 
staggered  experimentation.  This  argument  establishes  the  following  proposition  (proof  in 
the  text). 

Proposition  14.  A  patent  system  with  77  =  Hi  —  n2  induces  the  socially  efficient  copying 
and  experimentation  behavior  for  all  p  £  [ a,b\ ,  but  typically  induces  delayed  experimenta¬ 
tion. 

The  juxtaposition  of  Propositions  13  and  14  implies  that  when  signal  quality  is  heteroge¬ 
neous  and  private  information,  the  patent  system  can  ensure  either  rapid  experimentation 
or  the  socially  beneficial  ex  post  transfer  of  knowledge  (and  experimentation  by  the  right 
types),  but  will  not  typically  be  able  to  achieve  both  objectives  simultaneously.  However, 
appropriately  designed  patents  typically  improve  efficiency,  as  is  stated  in  the  following 
corollary.  Moreover,  note  that  unless  the  types  distribution,  i.e.,  G,  is  skewed  towards  low 
signals,  the  optimal  patent  payment  will  satisfy  p*  >  ^  —  n2. 

Corollary  1.  Suppose  that  there  are  two  firms  and  the  patent  system  specifies  a  payment 
77  >  0  for  copying.  Then,  the  aggregate  payoff  of  the  firms  is  higher  than  the  case  when 
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rj  =  0,  unless  both  firms  have  very  weak  signals,  i.e.,  P\,P2  <  P^iv);  where  p*{rj)  <  pv  is  a 
constant. 
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Chapter  5 


A  Model  of  Information  Exchange  in 
Social  Networks 


We  start  by  presenting  the  model  for  a  finite  set  J\fn  —  {1,  2,  •  •  •  ,  n}  of  agents.  As  we  are 
interested  in  economies  with  a  large  number  of  agents,  we  then  describe  the  limit  economy 
as  n  — »  oo. 

5.1  Actions,  Payoffs  and  Information 

Each  agent  i  G  Nn  chooses  an  action  x,  G  5ft.  Her  payoff  depends  on  her  action  and  an 
underlying  state  of  the  world  8,  which  is  an  exogenous  random  variable.  In  particular,  agent 
i’s  payoff  when  she  takes  action  Xi  and  the  state  of  the  world  is  8  is  given  by  7 r»  =  f(xi,8). 
We  impose  the  following  assumption  on  /: 

Assumption  4.  The  payoff  function  f(x,8)  is  quadratic  in  both  x  and  8.  Moreover,  f  is 
twice- differentiable  and  fxx  <  0,  fxo  0. 

Assuming  that  /  is  quadratic  greatly  simplifies  subsequent  analysis.  We  also  assume 
that  fxx  <  0  (concavity  with  respect  to  one’s  action),  so  as  to  guarantee  that  best  responses 
are  well-defined.  Finally,  fx8  /  0  ensures  that  8  is  relevant  for  the  behavior  of  an  individual. 
To  ease  exposition,  we  let  f(x,  8)  =  tt  —  (x  —  8)2,  where  n  is  a  constant.  It  is  straightforward 
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to  extend  our  results  to  any  function  that  satisfies  Assumption  4. 

The  state  of  the  world  9  is  unknown  and  agents  observe  noisy  signals  about  its  realiza¬ 
tion.  In  particular,  9  is  drawn  from  a  Normal  distribution  with  known  mean  //.  and  variance 
p.  Each  agent  receives  a  private  signal  s{  =  9  +  z{,  where  the  zfs  are  idiosyncratic  noises, 
independent  from  one  another  and  9,  with  common  mean  p,  normalized  to  0,  and  variance 
P- 

5.2  Communication 

Our  focus  is  on  information  aggregation,  when  agents  are  embedded  in  a  social  network 
structure  that  can  be  thought  of  as  imposing  communication  constraints  to  the  agents.  In 
particular,  agent  i  forms  beliefs  about  the  state  of  the  world  from  her  private  signal  Si , 
as  well  as  information  she  can  obtain  from  other  agents  through  a  given  communication 
network  Gn ,  which  will  be  described  shortly.  We  assume  that  time  t  e  [0,  oo)  is  continuous 
and  there  is  a  common  discount  rate  r  >  0.  At  a  given  time  instant  t.  agent  i  decides 
whether  to  take  an  irreversible  action  xt  (and  receive  payoff  f(xu  9)  discounted  by  e~rt)  or 
“wait”  so  as  to  obtain  more  information  from  her  peers.  Throughout  the  rest  of  the  thesis, 
we  say  that  the  agent  “exits”  at  time  f,  if  she  chooses  to  take  an  irreversible  action  at  time 
t.  Discounting  implies  that  an  earlier  exit  is  preferred  to  a  later  one.  In  terms  of  notation, 
we  let  it"  denote  the  utility  of  agent  i,  when  the  number  of  agents  in  the  society  is  n.  From 
above,  a"  =  errt  f(x%,  9 ),  when  agent  i  takes  action  x,t  at  time  t. 

Finally,  we  describe  how  agent  i  obtains  information  from  her  peers  in  the  social  network 
structure.  Let  the  directed  graph  Gn  =  (A fn,£n),  where  J\fn  =  {l,--  ,n}  is  the  set  of 
agents  and  £n  is  the  set  of  directed  edges  with  which  agents  are  linked,  represent  the 
communication  network,  in  which  agents  are  embedded.  We  say  that  agent  j  can  obtain 
information  from  i  or  that  agent  i  can  send  information  to  j  if  there  is  an  edge  from  i  to 
j  in  graph  Gn,  i.e.,  (i,j)  £  £n.  Let  Ift  denote  the  information  set  of  agent  i  at  time  t  and 
Ift  denote  the  set  of  all  possible  information  sets.  Then,  for  every  pair  of  agents  i,j,  such 
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that  (i,j)  G  £n,  we  say  that  agent  j  communicates  with  agent  i  or  that  agent  i  sends  a 
message  to  agent  j,  and  define  the  following  mapping 


m. 


ij,t 


■  Tn 


Ml,  for  (S.j)  €  r 


where  A4”-f  denotes  the  set  of  messages  that,  agent  i  can  send  to  agent  j  at  time  t.  The 
definition  of  rn"  t  captures  the  fact  that  communication  is  directed  and  is  only  allowed 
between  agents  that  are  linked  in  the  communication  network,  i.e.,  j  communicates  with  i 
if  and  only  if  (i,j)  G  En .  The  direction  of  communication  should  be  clear:  when  agent  j 
communicates  with  agent  i,  then  agent  i  sends  a  message  to  agent  j,  that  could  in  principle 
depend  on  the  information  set  of  agent  i  as  well  as  the  identity  of  agent  j.  Importantly, 
we  assume  that  the  cardinality  (“dimensionality”)  of  M\t  is  no  less  than  that  of  T-p  so 
that  communication  can  take  the  form  of  agent  i  sharing  all  her  information  with  agent 
j.  This  has  two  key  implications.  First,  an  agent  can  communicate  (indirectly)  with  a 
much  larger  set  of  agents  than  just  her  immediate  neighbors,  albeit  with  a  time  delay. 
Second,  mechanical  duplication  of  information  can  be  avoided.  For  example,  the  second 
time  agent  j  communicates  with  agent  i,  she  can  repeat  her  original  signal,  but  this  will 
not  be  recorded  as  an  additional  piece  of  information  by  agent  j ,  since  given  the  size  of  the 
message  space  At”  f,  each  piece  of  information  can  be  “tagged”.  This  ensures  that  under 
truthful  communication,  there  need  be  no  confounding  of  new  information  and  previously 
communicated  information.  Figure  5-1  illustrates  the  process  of  information  aggregation 
centered  at  a  particular  agent. 

The  times  at  which  communication  takes  place  are  exponentially  distributed  with  pa¬ 
rameter  A  >  0.1  Let  Tt  denote  the  set  of  times  that  agents  communicated  with  their 


1  Alternatively,  agents  “wake”  up  and  communicate  with  their  neighbors,  when  a  Poisson  clock  with 
rate  A  ticks. 
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(a)  Time  t  —  0.  (b)  First  communication  step. 


(  4) 


®-^(D 


(c)  Second  communication  step. 


Figure  5-1:  The  information  set  of  agent  1  under  truthful  communication. 
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neighbors  up  to  time  t.  That  defines  the  information  set  of  agent  i  at  time  t  >  0  as: 

J”t  =  {sj,  for  all  r  <E  Tt  and  j  such  that  (j,i)  G  £"} 

and  /"0  =  {sj.  In  particular,  the  information  set  of  agent  i  at  time  t  >  0  consists  of  her 
private  signal  and  all  the  messages  her  neighbors  sent  to  i  in  previous  communication  times. 
Agent  V s  action  at  time  t  is  a  mapping  from  her  information  set  to  the  set  of  actions,  i.e., 

alt'.  2^{‘‘wait’’}U5h 

The  tradeoff  between  taking  an  irreversible  action  and  waiting,  should  be  clear  at  this 
point.  An  agent  would  wait,  in  order  to  communicate  with  a  larger  set  of  agents  and  choose 
a  better  action.  On  the  other  hand,  future  is  discounted,  therefore,  delaying  is  costly. 

We  close  the  section  with  a  number  of  definitions.  We  define  a  path  between  agents 
i  and  j  in  network  G"  as  a  sequence  A,  •  •  ■  Jk  of  distinct  nodes  such  that  A  =  i,  %k  —  j 
and  (A,  4+i )  £  En  for  k  £  {1,  •  •  •  ,  K  -  1}.  The  length  of  the  path  is  defined  as  K  -  1. 
Moreover,  we  define  the  distance  of  agent  i  to  agent  j  as  the  length  of  the  shortest  path 
from  i  to  j  in  network  Gn,  i.e., 

distn(i,  j)  =  min{length  of  V  \  V  is  a  path  from  i  to  j  in  Gn}. 

Finally,  the  (indirect)  neighborhood  of  agent  i  at  communication  step  r  is  defined  as 

Blr  =  {j  |  distn{j,i)  <  t}, 

where  =  {?}.  i.e.,  consists  of  all  agents  that  are  at  most  r  links  away  from  agent  i  in 
graph  Gn.  Intuitively,  if  agent  i  waits  for  r  communication  steps  and  all  of  the  intervening 
agents  receive  and  communicate  information  truthfully,  i  will  have  access  to  all  of  the 
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signals  of  the  agents  in  the  set  Bf  . 

5.3  Equilibria  of  the  Information  Exchange  Game 

We  refer  to  the  game  defined  above  as  the  Information  Exchange  Game.  We  next  define 
the  equilibria  of  the  information  exchange  game  Tinfo(Gn),  given  that  the  communication 
network  is  given  by  Gn.  Note  that  we  use  the  standard  notation  cr_ ,•  to  denote  the  strategies 
of  agents  other  than  i.  Also,  we  let  cq  _t  denote  the  vector  of  actions  of  agent  i  at  all  times 
except  t. 

Definition  1.  An  action  strategy  profile  an'*  is  a  pure-strategy  Perfect  Bayesian  Equilib¬ 
rium  of  the  information  exchange  game  Yinfo(Gn )  if  for  every  i  £  J\fn  and  time  t,  off 
maximizes  the  expected  payoff  of  agent  i  given  the  strategies  of  other  agents  cr”’*,  be., 


71,* 


£  arg  max 

2/e{“u!aii”}u3? 


We  denote  the  set  of  equilibria  of  this  game  by  INFO(Gn). 

For  the  remainder,  we  refer  to  a  pure-strategy  Perfect  Bayesian  Equilibrium  simply  as 
equilibrium  (we  do  not  study  mixed  strategy  equilibria). 

Note  that  if  agent  i  decides  to  exit  and  take  an  action  at  time  t,  then  her  optimal  action 
would  be: 

xlt  =  arg  max  E[f(x,9)\  J"t]  =  E[0|i£], 

where  the  second  equality  holds  when  f(x,  6)  =  n  -  (x  -  6)2 .  Thus,  given  that  actions  are 
irreversible,  the  agent’s  decision  problem  reduces  to  determining  the  timing  of  her  action. 
It  is  straightforward  to  see  that  at  equilibrium  an  agent  takes  an  irreversible  action  imme¬ 
diately  after  some  communication  step  concludes.  Thus,  an  equilibrium  strategy  profile  a 
induces  an  equilibrium  timing  profile  where  r"’CT  designates  the  communication  step 
after  which  agent  i  exits  by  taking  an  irreversible  action.  The  r  notation  is  convenient  to 
use  for  the  statement  of  some  of  our  results  below. 
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5.4  Learning  in  Large  Societies 

We  are  interested  in  whether  equilibrium  behavior  leads  to  information  aggregation. 
This  is  captured  by  the  notion  of  “asymptotic  learning” ,  which  characterizes  the  behavior 
of  agents  over  communication  networks  with  growing  size.  We  first  focus  on  learning  over 
a  fixed  communication  network,  i.e. ,  we  study  agents’  decisions  along  equilibria  of  the 
information  exchange  game. 

We  consider  a  sequence  of  communication  networks  { G" } j  where  Gn  =  {Afn,Sn} 
with  J\fn  =  {1,  •  ■  •  ,  n}  and  refer  to  this  sequence  of  communication  networks  as  a  society. 
A  sequence  of  communication  networks  induces  a  sequence  of  information  exchange  games, 
and  with  a  slight  abuse  of  notation  we  use  the  term  equilibrium  to  denote  a  sequence 
of  equilibria  of  the  sequence  of  information  exchange  games,  or  of  the  society  {G'n}^=1. 
We  denote  such  an  equilibrium  by  a  =  {crn}^=1,  which  designates  that  an  6  INFO(Gn) 
for  all  n.  For  any  fixed  n  >  1  and  any  equilibrium  of  the  information  exchange  game 
an  £  INFO(Gn),  we  introduce  the  indicator  variable: 

{1  if  agent  i  takes  an  action  that  is  e-  “close”  to  the  optimal, 

(5.1) 

0  otherwise. 

In  other  words,  M”’e  =  1  if  and  only  if  agent  i  chooses  irreversible  action  xt,  such  that 
\xi  —  x*\  <  e,  where  x*  denotes  the  optimal  action  for  agent  i  given  complete  information 
(i.e.,  knowing  the  realization  of  6). 

Next  definition  introduces  e,  d-asymptotic  learning  for  a  given  society. 


Definition  2.  We  say  that  e,  S-asymptotic  learning  occurs  in  society  {G”}^  along  equi¬ 
librium  a  if  we  have: 


lim 

n—>oo 


-Ed  -mt) 

n 

i=i 
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This  definition2  equates  e,  fi-asymptotic  learning  occurs  with  all  but  an  e-  fraction  of  the 
agents  taking  an  action  that  is  £-“01086”  to  their  optimal  action  (as  the  society  grows 
infinitely  large)  with  probability  at  least  1  —  5.  Moreover,  we  define  perfect  asymptotic 
learning  as  follows. 

Definition  3.  We  say  that  perfect  asymptotic  learning  occurs  in  society  {Gn}ff=1  along 
equilibrium  a  if  we  have: 

>  =  0. 

for  any  e  >  0. 

Perfect  asymptotic  learning  is,  thus,  a  strong  definition  and  requires  all  but  a  negligible 
fraction  of  the  agents  taking  essentially  their  optimal  action. 

5.5  Assumptions  on  the  Information  Exchange  Pro¬ 
cess 

The  communication  model  described  in  Section  5.2  is  fairly  general.  In  particular,  we 
did  not  restrict  the  set  of  messages  that  an  agent  can  send  or  specify  her  information  set. 
Throughout,  we  maintain  the  assumption  that  the  communication  network  Gn  is  common 
knowledge.  Also,  we  focus  on  the  following  three  environments  of  increasing  complexity, 
defined  by  Assumptions  5,  6  and  7  respectively. 

Assumption  5  (Continuous  Communication).  Communication  between  agents  is  contin¬ 
uous  if 

m\t  =  {*<  for  all  i  e  B?r}, 

for  all  agents  i,j  and  time  periods  t,  where  r  =  \Tt\,  i.e.,  the  number  of  times  that  commu- 

2Note  that  we  could  generalize  Definition  2  by  introducing  yet  another  parameter  and  study  e,  <5,  (- 
asymptotic  learning,  in  which  case  we  would  require  that  limn-**,  lim^oo  PCT  ( [A  Ya=\  l1  ~  M"’c)]  >  C)  < 


lim 

n— >  oo 


n 

-  E  a  - 
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nication  took  place  imt.il  time  t. 

This  assumption  is  adopted  as  a  prelude  to  Assumptions  6  and  7,  because  it  is  simpler 
to  work  with  and  as  we  show  the  main  results  that  hold  under  this  assumption,  generalize 
to  the  more  complex  environments  generated  by  Assumptions  6  and  7.  Intuitively,  this 
assumption  compactly  imposes  three  crucial  features:  (1)  As  already  noted,  communication 
takes  place  by  sharing  signals,  so  that  when  agent  j  communicates  with  agent  i  at  time 
t,  then  agent  i  sends  to  j  all  the  information  agent  i  has  obtained  thus  far,  i.e. ,  the 
private  signals  of  all  agents  that  are  at  a  distance  at  most  r  =  \Tt\  from  i  (refer  back  to 
Figure  5-1  for  an  illustration  of  the  communication  process  centered  at  a  particular  agent); 
(2)  Communication  is  continuous  in  the  sense  that  agents  do  not  stop  transmitting  new 
information  even  after  taking  their  irreversible  action.  This  also  implies  that  agents  never 
exit  the  social  network,  which  would  be  a  good  approximation  to  friendship  networks  that 
exist  for  reasons  unrelated  to  communication;  (3)  Agents  cannot  strategically  manipulate 
the  messages  they  sent,  i.e.,  an  agent’s  private  signal  is  hard  information. 

Assumption  6  relaxes  the  second  feature  above,  the  continuous  transmission  of  infor¬ 
mation. 

Assumption  6  (Truthful  Communication).  Communication  between  agents  is  truthful, 
i.e., 

=  {se  for  all  £  E  /"J, 

for  all  agents  i,j  and  time  periods  t. 

Intuitively,  it  states  that  when  an  agent  takes  an  irreversible  action,  then  she  no  longer 
obtains  new  information  and,  thus,  can  only  communicate  the  information  she  has  obtained 
until  the  time  of  her  decision.  The  difference  between  Assumptions  5  and  6  can  be  seen 
from  the  fact  that  in  Assumption  6  we  write  Ift  as  opposed  to  Bfr ,  which  implies  that 
an  agent  stops  receiving  and  subsequently  communicating  new  information  as  soon  as  she 
takes  an  irreversible  action.  We  believe  that  this  assumption  is  a  reasonable  approximation 
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to  communication  in  social  networks,  since  an  agent  that  engages  in  information  exchange 
to  make  a  decision  would  have  weaker  incentives  to  collect  new  information  after  reaching 
that  decision.  Nevertheless,  she  can  still  communicate  the  information  she  had  previously 
obtained  to  other  agents,  We  call  this  type  of  communication  truthful  to  stress  the  fact 
that  the  agents  cannot  strategically  manipulate  the  information  they  communicate.3 

F inally,  we  discuss  the  implications  of  relaxing  Assumption  6  by  allowing  strategic  com¬ 
munication,  i.e.,  when  agents  can  strategically  he  or  babble  about  their  information.  In 
particular,  we  replace  Assumption  6  with  Assumption  7. 

Assumption  7  (Strategic  Communication).  Communication  between  agents  is  strategic  if 

><«  e  sI'A 


for  all  agents  i,j  and  time  periods  t. 

This  assumption  makes  it  clear  that  in  this  case  the  messages  need  not  be  truthful. 
Allowing  strategic  communication  adds  an  extra  dimension  in  an  agent’s  strategy,  since 
the  agent  can  choose  to  “lie”  about  (part)  of  her  information  set  with  some  probability,  in 
the  hope  that  this  increases  her  expected  payoff.  Note  that,  in  contrast  with  “cheap  talk” 
models,  externalities  in  our  framework  are  purely  informational  as  opposed  to  payoff  rele¬ 
vant.  Thus,  an  agent  may  have  an  incentive  to  “lie”  as  a  means  to  obtain  more  information 
from  the  information  exchange  process. 


3Yet  another  variant  of  this  assumption  would  be  that  agents  exit  the  social  network  after  taking  an 
action  and  stop  communicating  entirely.  In  this  case,  the  results  are  again  similar  if  their  action  is  observed 
by  their  neighbors.  If  they  exit  the  social  network,  stop  communication  altogether  and  their  action  is  not 
observable,  then  the  implications  are  different.  We  do  not  analyze  these  variants  in  the  current  version  to 
save  space. 
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Chapter  6 


Learning  and  Efficient 
Communication 


In  this  chapter,  we  present  our  main  results  on  learning  and  discuss  their  implications  for 
the  aggregate  welfare  of  the  agents.  Before  doing  so,  we  discuss  the  decision  problem  of  a 
single  agent,  i.e. ,  determining  the  best  time  to  take  an  irreversible  action  given  that  the 
rest  of  the  agents  behave  according  to  strategy  profile  a.  Later,  we  contrast  the  single 
agent  problem  with  that  of  a  social  planner,  whose  objective  is  to  maximize  the  expected 
aggregate  welfare. 

6.1  The  single  agent  problem 

Given  our  normality  assumption  on  both  8  and  the  private  signals  as  well  as  quadratic 
preferences,  i.e.,  f(x,  8)  =  n  —  (x  —  8)2,  the  expected  payoff  of  agent  i  taking  an  action  after 
observing  k  private  signals  (including  her  own)  is  given  by: 

1 

7r_  1/p2  +  1/p2  k' 

discounted  by  the  appropriate  factor.  Let  a)  denote  agent  V s  optimal  value  function 

at  information  set  /”f ,  when  the  rest  of  the  agents  behave  according  to  strategy  profile  a . 


71 


Then,  by  the  principle  of  optimality, 


(when  she  takes  the  optimal  irreversible  action), 
(when  she  decides  to  wait,  i.e.,  x  =  “wait”), 

where  k™f  denotes  the  number  of  distinct  private  signals  agent  i  has  observed  up  to  time 
t.  The  first  line  is  equal  to  the  expected  payoff  for  the  agent  when  she  chooses  the  optimal 
irreversible  action  under  information  set  Ift,  i.e.,  E[/9|/”J,  and  she  has  observed  k”f  private 
signals,  while  the  second  line  is  equal  to  the  discounted  expected  continuation  payoff.  For 
the  latter,  we  have  that  with  probability  A dt,  communication  takes  place  in  time  interval 
[t,t  +  dt],  thus  the  information  set  of  agent  i  expands;  with  probability  (1  —  A  dt)  there  is  no 
communication  and  the  value  function  for  agent  i  remains  unchanged.  If  communication 
takes  place  in  interval  [t,t  +  dt\,  then  agent  i  observes  -  \B™’lj\  additional  signals, 

where  recall  that  \Tt\  denotes  the  number  of  communication  rounds  up  to  time  t  and 
denotes  the  set  of  agents  that  are  distance  at  most  r  from  i  and  can  be  reached  by  i  under 
profile  cr,  i.e.,  there  exists  a  directed  path  from  agent  i  to  each  of  those  agents  in  Gn, 
that  has  length  at  most  r  and  t”,(T  >  r  —  x,  where  x  =  dist(i,j)  (which  ensures  that  the 
information  will  be  transmitted  by  j). 


Vi{rlva)  =  max 


7T  — 


l/p2+l/p2k 

~ rdt  E[Vi(7” 


77, ,  cr 


t+dt  i 


0] 


Note  that  since  we  assume  that  signals  are  statistically  equivalent  and  independent  (as 
well  as  truthful),  the  value  function  can  simply  be  expressed  as  a  function  of  the  number 
of  distinct  signals  in  /”t,  k”f  and  profile  a.  The  agent  will  choose  to  take  an  irreversible 
action  and  not  wait  if 


7T  — 


>  +  |B“|'|+1|  -  I B”("  ,U)  +  (1  -  \dt)Vi(k?f  ,<j))} 
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Thus,  we  obtain  that  the  agent  should  choose  not  to  wait  if: 


v,(K 


n,i 7 


i,\Tt\+l 


Rn,<r 


w)  < 


r  +  A 
A 


7T  — 


i/p2  +  i/p2Kt 


This  establishes  the  following  lemma,  which  states  that  an  agent’s  optimal  action  takes  the 
form  of  a  threshold  rule:  an  agent  decides  to  take  an  irreversible  action  at  time  t  as  long 
as  she  has  observed  more  that  A;"'’*}  t>  private  signals. 


Lemma  4.  Let  Assumption  6  hold.  Given  communication  network  Gn  and  equilibrium 
a  £  INFO(Gn),  there  exists  a  sequence  of  signal  thresholds  for  each  agent  i,  {krf*lt)}T=v, 
that  depend  on  the  current  time  period,  the  agent  i,  the  communication  network  Gn  and 
a  such  that  agent  i  maximizes  her  expected  utility  at  information  set  Ift  by  taking  action 
defined  as 


= 


■n-a  >  A-"  * 


ne\iu,  if Kt 

“wait”,  otherwise, 


A  consequence  of  Lemma  4  is  that  an  equilibrium  strategy  profile  a  defines  both  a  time 
in  which  agent  i  acts  (immediately  after  communication  step  r”,<T),  but  also  the  number  of 
signals  that  agent  i  has  access  to  when  she  acts. 


6.2  Asymptotic  Learning 

We  begin  the  discussion  by  introducing  the  concepts  that  are  instrumental  for  asymp¬ 
totic  learning:  the  observation  radius  and  k-radius  sets.  Recall  that  an  equilibrium  of  the 
information  exchange  game  on  communication  network  Gn,  an  £  INFO(Gn),  induces  a 
timing  profile  Tn,<J,  such  that  agent  i  takes  an  irreversible  action  after  r'f°  communication 
steps.  We  call  the  observation  radius  of  agent  i  under  equilibrium  profile  on .  Note  that 
under  Assumption  5,  there  is  no  dependence  of  an  agent’s  expected  payoff  on  the  partic¬ 
ular  equilibrium  strategies,  that  players  follow.  Therefore,  when  Assumption  5  holds,  the 
observation  radius  of  agent  i  is  the  same  irrespective  of  the  strategy  profile  followed  by  the 
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rest  of  the  agents,  thus  we  may  drop  superscript  a.  We  refer  to  agent  Vs  observation  radius 
r”  under  Assumption  5  as  the  perfect  observation  radius.  In  the  appendix  we  present  an 
efficient  algorithm  based  on  dynamic  programming  for  computing  the  observation  radius 
of  each  agent  i  given  that  the  rest  of  the  agents  follow  strategy  profile  a™,. 

Given  the  notion  of  an  observation  radius,  we  define  k-radius  sets  (and  similarly  perfect 
k-radius  sets )  as  follows. 

Definition  4.  Let  Vff'a  be  defined  as 

Vf*  =  {ieAf\  \B^\<k}. 

We  refer  to  Vf  'a  as  the  k-radius  set. 

Intuitively,  Vf'a  includes  all  agents  that  take  an  action  before  they  receive  signals  from 
more  than  k  other  individuals  at  equilibrium  on  -  the  size  of  their  (indirect)  neighborhood 
by  the  time  they  take  an  irreversible  action  is  no  greater  than  k.  Equivalently,  agent  i 
belongs  to  set  Vf'a  if  the  number  of  agents  that  lie  at  distance  less  than  r'1’"  from  i  are  at 
most  k.  From  Definition  4  it  follows  immediately  that 

i  £  Vkn'a  =>-  i  £  V£'a  for  all  k'  >  k.  (6.1) 

The  following  proposition  provides  a  necessary  and  a  sufficient  condition  for  e,  d-asymptotic 
learning  to  occur  in  a  society  under  equilibrium  profile  a. 

Proposition  15.  Let  Assumption  6  hold.  Then, 

(a)  e,  5 -asymptotic  learning  occurs  in  society  {Gn}cfL1  under  equilibrium  profile  a  if 

lim  -  •  \Vk’a\  <  (  <  e,  (6.2) 

n-Aoo  Jl  K  V  7 

where  k  >  k  and  k  such  that  er f  I  , e  1  —  erf  { - ,  e  1  >  1  — 

\y/2pjkj  V  V&jiJ  1_C 
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(b)  e,  S- asymptotic  learning  does  not  occur  in  society  {Gn}“=1  under  equilibrium  profile 
a  if 

lim  -■\V^\>i1>e,  (6-3) 

n— >oo  n 

where  k  <k  and  k  such  that  erf  ^ ^/2p2/i)  ~~  er ^  y/zjp/tj  <  ^  ~  _  e/^' 

Intuitively,  asymptotic  learning  is  precluded  if  there  exists  a  significant  fraction  of  the 
society  that  will  take  an  action  before  seeing  a  large  set  of  signals,  since  in  this  case  there 
will  be  a  large  enough  probability  of  each  individual  taking  an  action,  that  is  far  from 
the  optimal  one,  since  it  is  based  on  a  small  set  of  signals.  In  the  rest  of  this  section, 
we  provide  a  series  of  corollaries  of  Proposition  15,  that  provide  more  intuition  on  the 
asymptotic  learning  result.  The  first  one  provides  a  necessary  and  sufficient  condition  for 
perfect  asymptotic  learning  to  occur  in  any  equilibrium  profile. 

Before  stating  the  proposition,  we  define  the  notion  of  leading  agents.  Let  indegf  = 
outdegf  =  \{j\i  £  BrfA}\  denote  the  in-degree,  out-degree  of  agent  i  in  communication 
network  Gn  respectively. 

Definition  5.  A  collection  {/S,}^=1  of  sets  of  agents  is  called  a  set  of  leading  agents  ( leaders ) 

if 

(i)  There  exists  k  >  0,  such  that  Snj  C  VjfJ  for  all  j  £  J,  where  J  is  an  infinite  index 
set. 

(n)  limn  )oc  L  •  |5n|  =  0,  i.e.,  the  collection  {S}^  contains  a  negligible  fraction  of  the 
agents  as  the  society  grows. 

(Hi)  lim^oo  i  •  |  S]ollow\  >  e,  for  some  e  >  0,  where  S%llow  denotes  the  set  of  followers  of 
Sn.  In  particular,  i  £  Sfollow  if  there  exists  j  £  Sn,  such  that  j  £  BfTn . 

Then, 

Proposition  16.  Let  Assumption  6  hold.  Then, 
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(i)  Perfect  asymptotic  learning  occurs  in  society  {Gn}™=1  in  any  equilibrium  a  if 

lim  lim  -  ■  =  0.  (6.4) 

/c— 4oo  n—¥oo  Tl 

(ii)  Conversely,  if  condition  (6.4)  does  not  hold  for  society  {Gn}™=1and  the  society  does 
not  contain  a  set  of  leaders,  then  perfect  asymptotic  learning  does  not  occur  in  any 
equilibrium  a. 

Proposition  16  is  not  stated  as  an  if  and  only  if  result  because  the  fact  that  condition 
(6.4)  does  not  hold  in  a  society,  does  not  necessarily  preclude  perfect  asymptotic  learning  in 
the  presence  of  leading  agents.  In  particular,  if  the  leaders  delay  their  irreversible  decision 
long  enough,  then  a  large  fraction  of  the  rest  of  the  agents  may  take  an  irreversible  action  as 
soon  as  they  communicate  with  the  leading  agents  and,  thus,  perfect  asymptotic  learning 
fails  (however  the  aggregate  welfare  is  higher).  However,  if  the  leading  agents  do  not 
coordinate  at  equilibrium,  then  they  exit  early  and  this  may  lead  the  rest  of  the  agents 
to  take  a  delayed,  but  more  informed,  irreversible  action.  Note  that  Proposition  16  is 
equilibrium  independent,  i.e..  it  holds  for  all  equilibria  and  Condition  (6.4)  is  expressed  in 
terms  of  the  network  topology. 

The  next  three  corollaries  identify  the  role  of  certain  types  of  agents  on  information 
spread  in  a  given  society.  We  focus  on  perfect  asymptotic  learning,  since  we  can  obtain 
sharper  results,  however,  we  can  state  similar  corollaries  for  e,  8- asymptotic  learning  for 
any  e  and  5.  All  corollaries  are  again  expressed  in  terms  of  the  original  network  topology. 

Similarly,  to  perfect  A;-radius  sets,  we  define  sets  Uf  for  scalar  k  >  0  as 

Uf  =  {i  £  Mn\  there  exists  i  £  B™Tn  with  indegf;  >  k}, 

i.e.,  the  set  Uf  consists  of  all  agents,  which  have  an  agent  with  in-degree  at  least  k  within 
their  perfect  observation  radius. 


76 


Corollary  2.  Let  Assumption  6  hold.  Then,  perfect  asymptotic  learning  occurs  in  society 


{G"}~=i  if 


lim  lim  —  •  \UJh\  —  1. 

k— >oo  n- 4oo  71 


Intuitively,  Corollary  2  states  that  if  all  but  a  negligible  fraction  of  the  agents  are 
at  a  short  distance  (at  most  their  perfect  observation  radius)  from  an  agent  with  a  high 
in-degree,  then  asymptotic  learning  occurs.  This  corollary  therefore  identifies  a  group  of 
agents,  that  is  crucial  for  a  society  to  permit  asymptotic  learning:  information  mavens, 
who  have  high  in-degrees  and  can  thus  act  as  effective  aggregators  of  information  (a  term 
inspired  by  [36]).  Information  mavens  are  one  type  of  hubs,  the  importance  of  which  is 
clearly  illustrated  by  our  learning  results.  Our  next  definition  formalizes  this  notion  further 
and  enables  an  alternative  sufficient  condition  for  asymptotic  learning. 


Definition  6.  Agent  i  is  called  an  information  maven  of  society  ifi  has  an  infinite 

in- degree,  i.e.,  if 

lim  indeqf  =  oo. 

n->oo 

Let  MAVEN {{Gn}ff=1)  denote  the  set  of  mavens  of  society  {Gn}^L1. 

For  any  agent  j,  let  6rvuv£'-v-'1  denote  the  shortest  distance  defined  in  communication 
network  Gn  between  j  and  a  maven  k  €  M  AV£N ( { Gn }  , ) .  Finally,  let  Wn  denote  the 

set  of  agents  that  are  at  distance  at  most  equal  to  their  perfect  observation  radius  from  a 
maven  in  communication  network  Gn,  i.e.,  Wn  =  {j |  d]vl'4v£'V"'ri  <  rf } . 

The  following  corollary  highlights  the  importance  of  information  mavens  for  asymptotic 
learning.  Informally,  it  states  that  if  almost  all  agents  have  a  short  path  to  a  maven,  then 
asymptotic  learning  occurs. 


Corollary  3.  Let  Assumption  6  hold.  Then,  asymptotic  learning  occurs  in  society  {Gn}Ni 
if 

lim  -  •  \  Wn\  =  1. 

n— )-oo  77, 
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Corollary  3  thus  clarifies  that  asymptotic  learning  is  obtained  when  there  are  informa¬ 
tion  mavens  and  almost  all  agents  are  at  a  “short  distance”  away  from  one  (less  than  their 
observation  radius).1 

As  mentioned  in  the  Introduction,  a  second  type  of  information  hubs  also  plays  an 
important  role  in  asymptotic  learning.  While  mavens  have  high  in-degree  and  are  thus 
able  to  effectively  aggregate  dispersed  information,  because  our  communication  network  is 
directed,  they  may  not  be  in  the  right  position  to  distribute  this  aggregated  information. 
If  so.  even  in  a  society  that  has  several  information  mavens,  a  large  fraction  of  the  agents 
may  not  benefit  from  their  information.  Social  connectors,  on  the  other  hand,  are  defined 
as  agents  with  a  high  out-degree,  and  thus  play  the  role  of  spreading  the  information 
aggregated  by  the  mavens.2  Before  stating  the  proposition,  we  define  social  connectors. 

Definition  7.  Agent  i  is  called  a  social  connector  of  society  {Gn}“=1  if  i  has  an  infinite 
out-degree,  i.e.,  if 

lim  out, deg f  =  oo. 

n-Aoo 

The  following  corollary  illustrates  the  role  of  social  connectors  for  asymptotic  learning. 

Corollary  4.  Let  Assumption  6  hold.  Consider  society  {Gn}f'_1,  which  is  such  that  the 
sequence  of  in-  and  out-  degrees  is  non- decreasing  for  every  agent  and  the  set  of  information 
mavens  does  not  grow  at  the  same  rate  as  the  society  itself,  i.e., 

lim  \mav£M({ct‘}z.1)\  =  0 

n—too  Ji 


Then,  for  asymptotic  learning  to  occur,  the  society  should  contain  a  social  connector  within 

xThis  corollary  is  weaker  than  Corollary  2.  This  is  simply  a  technicality  because  the  sequence  of 
communication  networks  {G"'}T=i  is  arbitrary.  In  particular,  we  have  not  assumed  that  the  in-degree  of 
an  agent  is  non-decreasing  with  n,  thus  the  limit  in  the  corollary  may  not  be  well  defined  for  arbitrary 
sequences  of  communication  networks 

2For  simplicity  (and  to  avoid  the  technical  issues  mentioned  in  a  previous  footnote)  we  assume  for  this 
corollary  that  both  the  in-  and  out-degree  sequences  of  agents  are  non-decreasing  with  n  (note  that  we  can 
rewrite  the  proposition  for  any  sequence  of  in-  and  out-  degrees  at  the  expense  of  introducing  additional 
notation). 
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a  short  distance  to  a  maven,  i.e., 


dMAV£Af,n  jQr  some  social  connector  i. 


Corollary  4  thus  states  that  unless  a  non-negligible  fraction  of  the  agents  belongs  to  the 
set  of  mavens  and,  subsequently,  the  rest  can  obtain  information  directly  from  a  maven, 
then,  information  aggregated  at  the  mavens  is  spread  through  the  out-links  of  a  connector 
(note  that  an  agent  can  be  both  a  maven  and  a  connector).  Combined  with  the  previous 
corollaries,  this  result  implies  that  there  are  essentially  two  ways  in  which  society  can 
achieve  perfect  asymptotic  learning.  First,  it  may  contain  several  information  mavens  who 
not  only  collect  and  aggregate  information  but  also  distribute  it  to  almost  all  the  agents 
in  the  society.  Second,  it  may  contain  a  sufficient  number  of  information  mavens,  who 
pass  their  information  to  social  connectors,  and  almost  all  the  agents  in  the  society  are 
a  short  distance  away  from  social  connectors  and  thus  obtain  accurate  information  from 
them.  This  latter  pattern  has  a  greater  plausibility  in  practice  than  one  in  which  the  same 
agents  will  collect  and  distribute  dispersed  information.  For  example,  if  a  website  or  a 
news  source  can  rely  on  information  mavens  (journalists,  researchers  or  analysts)  to  collect 
sufficient  information  and  then  reach  a  large  number  of  individuals,  then  information  will 
be  economically  aggregated  and  distributed  in  the  society. 

The  results  summarized  in  Propositions  15  and  16  as  well  as  in  Corollaries  2,  3  and  4 
can  be  seen  both  as  positive  and  negative,  as  already  noted  in  the  Introduction.  On  the  one 
hand,  communication  structures  that  do  not  feature  information  mavens  (or  connectors) 
will  not  lead  to  perfect  asymptotic  learning,  and  information  mavens  may  be  viewed  as 
unrealistic  or  extreme.  On  the  other  hand,  as  already  noted  above,  much  communication 
in  modern  societies  happens  through  agents  that  play  the  role  of  mavens  and  connectors  (see 
again  [36]).  These  are  highly  connected  agents  that  are  able  to  collect  and  distribute  crucial 
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information.  Perhaps  more  importantly,  most  individuals  obtain  some  of  their  information 
from  news  sources,  media,  and  websites,  which  exist  partly  or  primarily  for  the  purpose  of 
acting  as  information  mavens  and  connectors.3 

6.3  Strategic  Communication 

Next  we  explore  the  implication  of  relaxing  the  assumption  that  agents  cannot  manipu¬ 
late  the  messages  they  send,  i.e.,  that  information  on  private  signals  is  hard.  In  particular, 
we  replace  Assumption  6  with  Assumption  7  and  we  allow  agents  to  lie  about  their  or  any 
of  the  private  signals  they  have  obtained  information  about,  i.e.,  ,  •  •  ■  ,  ^ 

(fin,  •  •  •  ,  Sj(/„  ),  where  the  latter  is  the  true  vector  of  private  signals  of  agents  in  Ift.  In¬ 
formally,  an  agent  has  an  incentive  to  misreport  information,  so  as  to  delay  her  neighbors 
taking  irreversible  actions,  which  in  turn  prolongs  the  information  exchange  process. 

Let  ( an,mn )  denote  an  action-message  strategy  profile,  where  mn  =  {in")  -  ■  ■  ,  to™} 
and  mf  =  for  j  such  that  i  E  Bfx.  Also  let  subscript  ( an,mn )  refer  to  the 

probability  measure  induced  by  the  action-message  strategy  profile. 

Definition  8.  An  action-message  strategy  profile  (crn’*,mn’*)  is  a  pure-strategy  Perfect 
Bayesian  Equilibrium  of  the  information  exchange  game  Tinf0(Gn)  if  for  every  i  E  A fn  and 
time  t,  we  have 


E 


((Jn 


*)0; 


In  n,* 


{lit))  >  E. 


n,,))(7 


for  all  m"  and  cr" . 

As  before,  we  denote  the  set  of  equilibria  of  this  game  by  INFO(Gn,  Sn). 


Similarly  we  extend  the  definitions  of  asymptotic  learning  [cf.  Definitions  2  and  3].  We 
show  that  strategic  communication  does  not  harm  perfect  asymptotic  learning.  The  main 

3For  example,  a  news  website  such  as  cnn.com  acts  as  a  connector  that  spreads  the  information  aggre¬ 
gated  by  the  journalists-mavens  to  interested  readers.  Similarly,  a  movie  review  website,  e.g.,  imdb.com, 
spreads  the  aggregate  knowledge  of  movie  reviewers  to  interested  movie  aficionados. 
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intuition  behind  this  result  is  that  it  is  weakly  dominant  for  an  agent  to  report  her  private 
signal  truthfully  to  a  neighbor  with  a  high  in-degree  (maven),  as  long  as  others  are  truthful 
to  the  maven. 

Theorem  1.  If  perfect  asymptotic  learning  occurs  in  society  {Gn}^f=1  under  Assumption 
6,  then  there  exists  a  equilibrium  {<r,m),  such  that  perfect  asymptotic  learning  occurs  in 
society  {Gn}^=1  along  equilibrium  ( <j,m )  when  we  allow  strategic  communication  (cf.  under 
Assumption  7). 

This  theorem  therefore  implies  that  the  focus  on  truthful  reporting  was  without  much 
loss  of  generality  as  far  as  perfect  asymptotic  learning  is  concerned.  In  any  communication 
network  in  which  there  is  perfect  asymptotic  learning,  even  if  agents  can  strategically 
manipulate  information,  there  is  arbitrarily  little  benefit  in  doing  so.  Thus,  the  main  lessons 
about  asymptotic  learning  derived  above  apply  regardless  of  whether  communication  is 
strategic  or  not. 

However,  this  theorem  does  not  imply  that  all  learning  outcomes  are  identical  under 
truthful  and  strategic  communication.  In  particular,  interestingly,  we  can  construct  ex¬ 
amples  in  which  strategic  communication  leads  agents  take  a  better  action  with  higher 
probability  than  under  non-strategic  communication  (cf.  Assumption  6).  The  main  reason 
for  this  (counterintuitive)  fact  is  that  under  strategic  communication  an  agent  may  delay 
taking  an  action  compared  to  the  non-strategic  environment.  Therefore,  the  agent  obtains 
more  information  from  the  communication  network  and,  consequently,  chooses  an  action, 
that  is  closer  to  optimal. 

6.4  Welfare 

In  this  section,  we  turn  attention  to  the  question  of  efficient  communication  and  draw 
a  connection  with  our  previous  results  on  asymptotic  learning.  In  particular,  we  compare 
equilibrium  allocations,  i.e.,  communication  and  action  profiles  at  equilibrium,  with  a  set¬ 
ting  in  which  a  social  planner  dictates  the  timing  of  agents’  actions.  We  identify  conditions 
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under  which  a  social  planner  can  /  cannot  improve  in  terms  of  the  aggregate  expected  wel¬ 
fare  over  an  equilibrium  strategy  profile.  In  doing  so,  we  illustrate  an  interesting  feature  of 
social  networks:  communication  might  be  inefficient,  because  agents  do  not  internalize  the 
positive  externality  that  delaying  their  action  generates  for  their  peers. 

A  social  planner,  whose  objective  is  to  maximize  the  aggregate  expected  welfare  of  the 
population  of  n  agents,  would  choose  to  implement  the  timing  profile,  that  is  obtained 
as  a  solution  to  the  optimization  problem  below.  We  call  this  timing  profile  the  optimal 
allocation  and  we  denote  it  by  spn.  Note  that  spn  =  (t"  ,  •  ■  •  ,  t") 

n 

max^Esf[<]  (6.5) 

i= 1 

Similarly  with  the  asymptotic  analysis  for  equilibria,  we  define  a  sequence  of  optimal 
allocations  for  societies  of  growing  size,  sp  =  {spn}™=1.  We  are  interested  in  identifying 
conditions  under  which  the  social  planner  can  /  cannot  achieve  an  asymptotically  bet¬ 
ter  allocation  than  an  equilibrium  (sequence  of  equilibria)  <7,  i.e.,  we  are  looking  at  the 
expression: 

/AigJV"  [Ui  ]  ~  E^n 

n~4oo  fl 

First,  we  state  a  theorem  that  shows  a  direct  connection  between  learning  and  efficient 
communication . 

Theorem  2.  Consider  society  If  perfect  asymptotic  learning  occurs  at  the  optimal 

allocation  sp  =  {3pn}^h1?  then  all  equilibria  are  asymptotically  efficient,  i.e., 

lim  ]  ~  ICteJV"  _  q 

n-»oo  fl 


for  all  equilibria  a. 

Note  that  from  Proposition  16  it  follows  that  if  perfect  learning  occurs  at  the  optimal 
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allocation,  then  perfect  learning  occurs  in  all  equilibria  a. 


In  the  remainder  of  the  section,  we  provide  a  result  that  can  be  thought  of  as  a  partial 
converse  of  Theorem  2.  Before  stating  it,  we  contrast  the  single  agent  decision  problem 
with  that  of  the  social  planner.  In  particular,  consider  agent  i.  Then,  i  will  decide  to  take 
an  irreversible  action  at  time  t  and  not  wait  for  an  additional  dt,  when  other  agents  behave 
according  to  a,  if  (cf.  Section, 6.1): 


r  +  A 
r 


i /p2  + 1 /p2K? 


>  ViiKf  +  I B, 


n,cr 

i,\Tt\+l 


I  un<a 


W) 


(6.6) 


Similarly,  in  the  corresponding  optimal  allocation  agent  i  will  exit  at  time  t  and  not  wait 
if: 


r  +  A 


7 r 


i /p‘  + 

>  WST  +  IBJffi+,1  -  IB3SMP)  +  E%KI*  “waits"  “exits”  •»<!• 

(6.7) 


Contrasting  (6.6)  with  (6.7)  shows  the  reason  for  why  equilibria  are  inefficient  in  this  setting: 
when  determining  when  to  act  agent  i  does  not  take  into  account  the  positive  externality 
that  a  later  action  exerts  on  others,  which  is  expressed  by  the  summation  on  the  right  hand 
side  of  (6.7).  We  are  left  with  showing  sufficient  conditions  under  which  a  social  planner 
outperforms  an  equilibrium  allocation  a.  Consider  agents  i  and  j  such  that  i  G  and 
Tn'a  >  +  1,  which  implies  that  Bnr„.„  D  J3nT„,CT  (i.e.,  agent  j  communicates  with  a 

superset  of  the  agents  that  i  communicates  with  before  taking  an  action).  Also,  let 
denote  the  additional  agents  that  j  would  observe  if  i  delayed  her  irreversible  action  by  dt 
and  communication  took  place.  Then,  the  aggregate  welfare  of  the  two  agents  increases  if 
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the  following  condition  holds: 


m. 


n,a 

n.cr 


+  k  : 


n,a 


l3,Ti 


0  +  Vi(k^n,a 


+  hn:\,„ 

V,T  i 


)  >  Vj(k% (6.8) 


A 


In  particular,  let  <  k  (and  in  consequence  kn,an,a  <  k ).  Then,  (6.8)  holds  if  A:,  < 

A:”’(h.CT  <  A:2  (fci,  k2  constants),  where  the  lower  bound  guarantees  that  the  aggregate  welfare 
increases  by  the  additional  delay,  whereas  the  upper  bound  ensures  that  agent  i  would  find 
it  optimal  to  “exit”  after  communication  step  rfa  at  equilibrium  a. 

Now  let  set  DfJ  denote  the  following  set  of  agents:  j  e  D^’J, ,  if  <  k  and  there 

exists  an  i  6  such  that  if  i  exits  at  r/’cr  +  1,  then  j  gains  access  to  at  least  an 

additional  |  signals.  Then,  the  following  proposition  provides  a  sufficient  condition  for  an 
equilibrium  to  be  inefficient. 

Proposition  17.  Consider  society  {Gn}% °=1  and  equilibrium  a  =  {a71}^.  Assume  that 


n-f  oo  ?7, 


>  £  >  0, 


for  k,  £  that  satisfy  the  following: 


1/p2  +  1/p2 (k  +  X)<\2+  i)  l/p2  +  l/p2 k  +  X7" 
Then,  there  exists  an  f  >  0,  such  that 

[Uj  ]  —  EigiV"  ^ 

n— >oc  fi 


i.e.,  equilibrium  a  is  asymptotically  inefficient.  Moreover,  there  exist  e,5  such  that  e,6- 
asymptotic  learning  fails  at  equilibrium  a. 
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Chapter  7 


Asymptotic  Learning  in  Random 
Graphs 


As  an  illustration  of  the  results  we  outlined  in  Section  6,  we  apply  them  to  a  series  of 
commonly  studied  random  graph  models.  We  begin  by  providing  the  definitions  for  the 
graph  models  we  focus  on.  Note  that  in  the  present  section  we  assume  that  communication 
networks  are  bidirectional,  or  equivalently  that  if  agent  i  G  Bfy  then  j  G  Bfx . 

Definition  9.  A  sequence  of  communication  networks  {Gn}%Lt,  where  Gn  =  {A"'1,  £nj,  is 
called 

(i)  complete  if  for  every  n  we  have 

(i,j)  £  £n  for  all  i,j  G  Nn  ■ 

(ii)  k-bounded  degree  for  scalar  k  >  0,  if  for  every  n  we  have 

|  Bfy  |  <  k  for  all  i  G  Mn  > 

where  recall  that  denotes  the  agents  that  are  one  link  away  from  agent  i  in  com- 
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munication  network  Gn. 


(in)  a  star  if  for  every  n  we  have 


(i,  1)  G  £n  and  (■ i,j )  ^  £n  for  all  i  G  J\fn  and  j  1  ■ 


Definition  10  (Erdos- Renyi).  A  sequence  of  communication  networks  {Gn}fL1,  where 
Gn  =  {J\fn,£n},  is  called  Erdos- Renyi  if  for  every  n  we  have 

P  £  £n)  =  —  independently  for  all  i,j  G  J\fn  , 
n 

where  p  scalar,  such  that  0  <  p  <  1. 


Definition  11  (Power-Law).  A  sequence  of  communication  networks  {Gn}fLl ,  where  G"  = 
{ffn,  £n },  is  called  Power- Law  with  exponent  7  >  0  if  we  have 


lim 

n—>oo 


SieiV'1 


n 


=  Cfc  •  k  1  for  every  scalar  k  >  0, 


where  Cf.  is  a  constant.  In  other  words ,  the  fraction  of  nodes  in  the  network  having  degree 
k,  for  every  k  >  0,  follows  a  power  law  distribution  with  exponent  7. 

Definition  12  (Preferential  Attachment).  A  sequence  of  communication  networks  {Gn}f(=l, 
where  Gn  —  {A fn,£n},  is  called  preferential  attachment  if  it  was  generated  by  the  following 
process: 

(i)  Begin  the  process  with  Gl  —  {{1},  {(1, 1)},  i.e.,  the  communication  network  that 
contains  agent  1  and  a  loop  edge. 

(ii)  At  step  n,  add  agent  n  +  1  to  Gn .  Choose  an  agent  w  from  Gn  and  let  £n+1  = 
£n  +  (n+l,w).  Agent  w  is  chosen  according  to  the  preferential  attachment  rule,  i.e., 
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w  =  j  for  j  E  Nn  with  probability 


P(w  =  j ) 


degjj) 
deg{£)  ’ 


w/iere  deg(j)  denotes  the  degree  of  node  j  at  the  step. 

Definition  13  (Hierarchical).  A  sequence  of  communication  networks  {Gn}fL1;  where 
Gn  =  {J\fn,£n},  is  called  (- hierarchical  (or  simply  hierarchical)  if  it  was  generated  by 
the  following  process: 

(i)  Agents  are  born  and  placed  into  layers.  In  particular,  at  each  step  n  =  1,  ■  •  •  ,  a  new 
agent  is  born  and  placed  in  layer  I. 

(ii)  Layer  index  I  is  initialized  to  1  (i.e.,  the  first  node  belongs  to  layer  1).  A  new  layer 
is  created  ( and  subsequently  the  layer  index  increases  by  one)  at  time  period  n  >  2 
with  probability  -prty,  where  £  >  0. 

(in)  Finally,  for  every  n  we  have 

P  ((i,j)  G  £n )  =  j^nj,  independently  for  all  i,j  G  J\fn  that  belong  to  the  same  layer  £, 


where  jVf  denotes  the  set  of  agents  that  belong  to  layer  £  at  step  n  and  p  scalar,  such 
that  0  <  p  <  1 .  Moreover, 


F((i,k)e£n)  = 


1 

I  Af<e 


and  ^  ^  P  ((i,  k )  G  £n )  =  1  for  all  i  G  Nf,  k  G  £  >  1, 

k^Nci 


where  denotes  the  set  of  agents  that  belong  to  a  layer  with  index  lower  than  £  at 
step  n. 


Intuitively,  a  hierarchical  sequence  of  communication  networks  resembles  a  pyramid, 
where  the  top  contains  only  a  few  agents  and  as  we  move  towards  the  base,  the  number 
of  agents  grows.  The  following  argument  provides  an  interpretation  of  the  model.  Agents 
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(a)  Hierarchical  Society.  (b)  Complete  Society.  (c)  Star  Society. 

Figure  7-1:  Example  society  structures. 


on  top  layers  can  be  thought  of  as  “special”  nodes,  that  the  rest  of  the  nodes  have  a  high 
incentive  connecting  to.  Moreover,  agents  tend  to  connect  to  other  agents  in  the  same  layer, 
as  they  share  common  features  with  them  ( homophily ).  As  a  concrete  example,  academia 
can  be  thought  of  as  such  a  pyramid,  where  the  top  layer  includes  the  few  institutions, 
then  next  layer  includes  academic  departments,  research  labs  and  finally  at  the  lower  levels 
reside  the  home  pages  of  professors  and  students. 

Proposition  18.  Let  Assumptions  6  hold  and  consider  society  {G"}^!!  and  discount  rate 
r  >  0.  Then, 

(i)  Perfect  asymptotic  learning  does  not  occur  in  society  {G"}^  if  the  sequence  of  com¬ 
munication  networks  {G”}^L1  is  k-bounded,  for  some  constant  k  >  0. 

(ii)  For  every  e  >  0,  asymptotic  learning  does  not  occur  in  society  {G"}^^  with  probability 
at  least  1  —  e,  if  the  sequence  of  communication  networks  {G71}^  is  preferential 
attachment. 

(Hi)  For  every  e  >  0,  asymptotic  learning  does  not  occur  in  society  {G71}^  with  probability 
at  least  1  —  e,  if  the  sequence  of  communication  networks  {Gn}^=1  is  Erdos-Renyi. 

Proposition  19.  Let  assumptions  6  hold  and  consider  society  {G71}^.  Then, 

(i)  Asymptotic  learning  occurs  in  society  {Gn}(£=1  if  the  sequence  of  communication  net¬ 
works  {G"}^  is  complete  and  the  discount  rate  r  is  smaller  than  some  scalar  r\  >  0. 
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(ii)  Asymptotic  learning  occurs  in  society  if  the  sequence  of  communication  net 

works  {G"}^  is  a  star  and  the  discount  rate  r  is  smaller  than  some  scalar  r2  >  0. 

(in)  Let  e  >  0.  Then,  asymptotic  learning  occurs  in  society  {Gn}^L1  with  probability  at 
least  1  -  e,  if  the  sequence  of  communication  networks  {Gn})f=1  is  7- power  law,  with 
7  <  2  and  the  discount  rate  r  is  smaller  than  some  scalar  r 3  >  0. 

(iv)  Let  e  >  0.  Then,  asymptotic  learning  occurs  in  society  {GT,}^‘L1  with  probability  at 
least  1  -  e,  if  the  sequence  of  communication  networks  {G71}^  is  ((e) -hierarchical 
and  the  discount  rate  r  is  smaller  than  some  scalar  r 4  >  0. 

The  results  presented  provide  additional  insights  on  the  conditions  under  which  asymp¬ 
totic  learning  takes  place.  The  popular  preferential  attachment  and  Erdos-Renyi  graphs 
do  not  lead  to  asymptotic  learning,  which  can  be  interpreted  as  implying  that  asymptotic 
learning  is  unlikely  in  several  important  networks.  Nevertheless,  these  network  structures, 
though  often  used  in  practice,  do  not  provide  a  good  description  of  the  structure  of  many 
real  life  networks.  In  contrast,  our  results  also  showed  that  asymptotic  learning  takes  place 
in  power  law  graphs  with  small  exponent  7  <  2,  and  such  graphs  appear  to  provide  a 
better  representation  for  many  networks  related  to  communication,  including  for  peer-to- 
peer  networks,  scientific  collaboration  networks  (in  experimental  physics)  and  traffic  in 
networks  ([50],  [57],  [69],  [70]).  Asymptotic  learning  also  takes  place  in  hierarchical  graphs, 
where  “special”  agents  are  likely  to  receive  and  distribute  information  to  lower  layers  of 
the  hierarchy. 
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Chapter  8 


Network  Formation 

In  previous  chapters,  we  studied  information  exchange  among  agents  over  a  given  com¬ 
munication  network  Gn  =  (Afn,Sn).  In  the  present  chapter,  we  turn  attention  to  the 
complementary  problem  of  how  this  communication  network  emerges.  We  assume  that 
link  formation  is  costly.  In  particular,  communication  costs  are  captured  by  annxn  non¬ 
negative  matrix  Cn,  where  C fj  denotes  the  cost  that  agent  i  has  to  incur  in  order  to  form 
the  directed  link  (j,  i)  with  agent  j.  As  noted  previously,  a  link’s  direction  coincides  with 
the  direction  of  the  flow  of  messages.  In  particular,  agent  i  incurs  a  cost  to  form  in-links. 
We  refer  to  Cn  as  the  communication  cost  matrix.  We  assume  that  C’f  =  0  for  all  i  G  Mn. 

We  define  agent  i’s  link  formation  strategy ,  g" ,  as  an  n-tuple  such  that  gf  E  {0,  l}n  and 
g'f  =  1  implies  that  agent  i  forms  a  link  with  agent  j.  The  cost  agent  i  has  to  incur  if  she 
implements  strategy  g"  is  given  by 

cost  W)  =  £cy95- 

jeM 

The  link  formation  strategy  profile  gn  =  (g\l.  ■  ■  ■  ,  .<?" )  induces  the  communication  network 
Gn  =  ( J\fn ,  £”),  where  ( j,i )  G  En  if  and  only  if  gf3  =  1. 

We  extend  the  previously  described  environment  to  the  two-stage  Network  Learning 
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Game  T(Cn),  where  Cn  denotes  the  communication  cost  matrix.  The  two  stages  of  the 
network  learning  game  can  be  described  as  follows: 

Stage  1  [Network  Formation  Game]:  Agents  pick  their  link  formation  strategies.  The 
link  formation  strategy  profile  gn  induces  the  communication  network  Gn  =  (A fn,En). 

We  refer  to  stage  1  of  the  network  learning  game,  when  the  communication  cost  matrix  is 
Cn  as  the  network  formation  game  and  we  denote  it  by  Tnet(Cn). 

Stage  2  [Information  Exchange  Game]:  Agents  communicate  over  the  induced  net¬ 
work  Gn  as  studied  in  previous  chapters. 

We  next  define  the  equilibria  of  the  network  learning  game  T(Cn).  Note  that  we  use 
the  standard  notation  and  <j_j  to  denote  the  strategies  of  agents  other  than  i.  Also,  we 
let  <Tj  _t  denote  the  vector  of  actions  of  agent  i  at  all  times  except  t. 

Definition  14.  A  pair  ( gn’*,an •*)  is  a  pure-strategy  Perfect  Bayesian  Equilibrium  of  the 
network  learning  game  r((7n)  if 

(a)  an’*  E  INFO(Gn),  where  Gn  is  induced  by  the  link  formation  strategy  gn'*. 

(b)  For  all  i  £  J\fn ,  gf*  maxtmizes  the  expected  payoff  of  agent  i  given  the  strategies  of 
other  agents  gf* ,  i.e., 

gf*  E  arg  max  £*[11 i{z,gf*)]  =  E*«| If0)  -  Cost(z). 

2e{o,i}"  ’ 

for  all  a  E  INFO{Gn),  where  Gn  is  induced  by  link  formation  strategy  (z.  gf*). 

We  denote  the  set  of  equilibria  of  this  game  by  NET(Cn). 

8.1  Learning  in  endogenous  networks 

Similar  to  the  analysis  of  the  information  exchange  game,  we  consider  a  sequence  of 
communication  cost  matrices  {Cn}rff=l,  where  for  fixed  n, 

Cn  :  J\fn  x  J\fn  -4  and  Off  =  C"+i  for  all  i,  j  E  Nn.  (8.1) 
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For  the  remainder  of  the  section,  we  focus  our  attention  to  the  social  cliques  commu¬ 
nication  cost  structure.  The  properties  of  this  communication  structure  are  stated  in  the 
next  assumption. 

Assumption  8.  Let  c”-  G  {0,  c}  for  all  pairs  (i,j)  G  J\fn  x  J\fn ,  where  c  <  1/p2  ■ 

Moreover,  let  Cij  =  Cji  for  all  i,j  G  Afn  (symmetry),  and  Cij  +  cjk  >  cik  for  all  i.  j.  k  G  Afn 
(triangular  inequality) . 

The  assumption  that  c  <  x/  rules  out  the  degenerate  case  where  no  agent  forms  a 

costly  link.  The  symmetry  and  triangular  inequality  assumptions  are  imposed  to  simplify 
the  definition  of  a  social  clique,  which  is  introduced  next.  Let  Assumption  8  hold.  We 
define  a  social  clique  (cf.  Figure  8-1)  Hn  C  Afn  as  a  set  of  agents  such  that 

i,j  G  Hn  if  and  only  if  ctj  =  =  0. 

Note  that  this  set  is  well-defined  since,  by  the  triangular  inequality  and  symmetry  assump¬ 
tions,  if  an  agent  i  does  not  belong  to  social  clique  Hn,  then  ctJ  =  c  for  all  j  G  Hn. 
Hence,  we  can  uniquely  partition  the  set  of  nodes  A fn  into  a  set  of  Kn  pairwise  dis¬ 
joint  social  cliques  Wl  =  {//{',  •  -  <  ,  H’f-n } .  We  use  the  notation  Unk  to  denote  the  set 
of  pairwise  disjoint  social  cliques  that  have  cardinality  greater  than  or  equal  to  k,  i.e., 
'H™  =  [Hf,  i  —  1, . . . ,  Kn  |  | if" |  >  k}.  We  also  use  scn(i)  to  denote  the  social  clique  that 
agent  i  belongs  to. 

We  consider  a  sequence  of  communication  cost  matrices  {Cn}(f=1  satisfying  condition 
(8.1)  and  Assumption  8,  and  we  refer  to  this  sequence  as  a  communication  cost  structure. 
As  shown  above,  the  communication  cost  structure  uniquely  defines  the  following 

sequences,  {'Hn}%=1  and  for  k  >  0,  of  sets  of  pairwise  disjoint  social  cliques. 

Moreover,  it  induces  network  equilibria  (g,cr)  —  (gn,an)^=1  such  that  (gn,  <rn)  G  NET(Cn) 
for  all  n.  Our  goal  is  to  identify  conditions  on  the  communication  cost  structure,  that  lead 
to  the  emergence  of  networks,  that  guarantee  asymptotic  learning.  We  focus  entirely  on 
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Social  clique  1  Social  clique  2 


Figure  8-1:  Social  cliques. 


perfect  asymptotic  learning,  as  this  enables  us  to  obtain  sharp  results. 

Theorem  3.  Let  {Cn}^=1  be  a  communication  cost  structure  and  let  Assumptions  6  and 
8  hold.  Then,  there  exists  a  constant  k  =  k{c)  such  that  the  following  hold: 

(a)  Suppose  that 

\m\ 

limsup J — —  >  e  for  some  e  >  0.  (8.2) 

n— >oo  n 


Then,  perfect  asymptotic  learning  does  not  occur  in  any  network  equilibrium  (g,cr). 

(b)  Suppose  that 

\njn  | 

lim  — —  =  0  and  lim  \Hf\  =  oo  for  some  £.  (8.3) 

n—¥  oo  n  n—too  11 


Then,  perfect  asymptotic  learning  occurs  in  all  network  equilibria  ( g ,  a)  when  the 
discount  rate  r  satisfies  0  <  r  <  f ,  where  f  >  0  is  a  constant. 

(c)  Suppose  that 

\yn\ 

lim  - — —  =  0  and  lim  sup  I  Hf\  <  M  for  all  £,  (8.4) 

n->oo  n  n->  oo  11 

where  M  >  0  is  a  scalar,  and  let  agents  be  patient,  i.e.,  consider  the  case,  when  the 
discount  rate  r  — >•  0.  Then,  there  exists  a  c  >  0  such  that 
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:  Clicme  with  size  >  k  •  :  Individual  Agent 


(a)  Equilibrium  network,  when  (8.3)  holds.  (b)  Equilibrium  network,  when  (8.4)  holds. 

Figure  8-2:  Network  formation  among  social  cliques. 

(i)  If  c  <  c,  perfect  asymptotic  learning  occurs  in  all  network  equilibria  (<?,cr). 

(ii)  If  c>  c,  perfect  asymptotic  learning  depends  on  the  network  equilibrium  consid¬ 
ered. 

In  particular,  there  exists  at  least  one  network  equilibrium  ( g,a ),  where  there  is 
no  perfect  asymptotic  learning  and  there  exists  at  least  one  network  equilibrium 
( g ,  a)  where  perfect  asymptotic  learning  occurs. 


The  results  in  this  theorem  provide  a  fairly  complete  characterization  of  what  types  of 
environments  will  lead  to  the  formation  of  networks  that  will  subsequently  induce  perfect 
asymptotic  learning.  The  key  concept  is  that  of  a  social  clique,  which  represents  groups 
of  individuals  that  are  linked  to  each  other  at  zero  cost.  These  can  be  thought  of  as 
“friendship  networks,”  which  are  linked  for  reasons  unrelated  to  information  exchange  and 
thus  can  act  as  conduits  of  such  exchange  at  low  cost.  Agents  can  exchange  information 
without  incurring  any  costs  (beyond  the  delay  necessary  for  obtaining  information)  within 
their  social  cliques.  However,  if  they  wish  to  obtain  further  information,  from  outside 
their  social  cliques,  they  have  to  pay  a  cost  at  the  beginning  in  order  to  form  a  link. 
Even  though  network  formation  games  have  several  equilibria,  the  structure  of  our  network 
formation  and  information  exchange  game  enables  us  to  obtain  relatively  sharp  results  on 
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what  types  of  societies  will  lead  to  endogenously  formed  communication  networks  that 
ensure  perfect  asymptotic  learning.  In  particular,  the  first  part  of  Theorem  3  shows  that 
perfect  asymptotic  learning  cannot  occur  in  any  equilibrium  if  the  number  of  sufficiently 
large  social  cliques  increases  at  the  same  rate  as  the  size  of  the  society  (or  its  lim  sup  does 
so).  This  is  intuitive;  when  this  is  the  case,  there  will  be  many  social  cliques  of  sufficiently 
large  size  that  none  of  their  members  wish  to  engage  in  further  costly  communication  with 
members  of  other  social  cliques.  But  since  several  of  these  will  not  contain  an  information 
hub  social  learning  is  precluded. 

In  contrast,  the  second  part  of  the  theorem  shows  that  if  the  number  of  disjoint  and 
sufficiently  large  social  cliques  is  limited  (grows  less  rapidly  than  the  size  of  the  society) 
and  some  of  them  are  large  enough  to  contain  information  hubs,  then  perfect  asymptotic 
learning  will  take  place  (provided  that  future  is  not  heavily  discounted).  In  this  case,  as 
shown  by  Figure  8-2(a),  sufficiently  many  social  cliques  will  connect  to  the  larger  social 
cliques  acting  as  information  hubs,  ensuring  effective  aggregation  of  information  for  the 
great  majority  of  the  agents  in  the  society.  It  is  important  that  the  discount  factor  is  not 
too  small,  otherwise  smaller  cliques  will  not  find  it  beneficial  to  form  links  with  the  larger 
cliques. 

Finally,  the  third  part  of  the  theorem  outlines  a  more  interesting  configuration,  poten¬ 
tially  leading  to  perfect  asymptotic  learning.  In  this  case,  many  small  social  cliques  form 
an  “informational  ring”  (Figure  8-2(b))  .  Each  is  small  enough  that  it  finds  it  beneficial 
to  connect  to  another  social  clique,  provided  that  this  other  clique  will  also  connect  to 
others  and  obtain  further  information.  This  intuition  also  clarifies  why  such  information 
aggregation  takes  place  only  in  some  equilibria.  The  expectation  that  others  will  not  form 
the  requisite  links  leads  to  a  coordination  failure.  Interestingly,  however,  if  agents  are  suf¬ 
ficiently  patient  and  the  cost  of  link  formation  is  not  too  large,  the  coordination  failure 
equilibrium  disappears,  because  it  becomes  beneficial  for  each  clique  to  form  links  with 
another  one,  even  if  further  links  are  not  forthcoming. 
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Chapter  9 


Conclusion 

This  thesis  studies  optimal  stopping  problems,  when  agents  interact  with  each  other  in  a 
strategic  fashion.  We  describe  our  framework  in  the  context  of  two  economic  applications: 
(1)  experimentation  and  innovation  in  a  competitive  multi-firm  R&D  market  and  (2)  infor¬ 
mation  exchange  among  individuals,  that  are  embedded  in  a  social  network  structure.  The 
key  feature  of  our  framework  is  a  tradeoff  that  economic  agents  are  facing  when  choosing 
when  to  act.  Observing  other  agents’  actions  (and  their  outcomes)  may  increase  one’s 
expected  payoff  either  directly  (experimentation  model)  or  indirectly  (through  information 
spillovers).  Thus,  there  is  an  incentive  to  delay.  However,  future  is  discounted,  therrefore 
agents  prefer  earlier  than  later  actions.  We  conclude  by  summarizing  the  results  we  obtain 
in  both  models.  We  also  discuss  a  number  of  interesting  avenues  for  future  research. 

9.1  Experimentation,  patents  and  innovation 

In  the  first  part  of  the  thesis,  we  present  a  model  of  experimentation  and  innovation 
in  a  multi-firm  environment.  In  the  baseline  version  of  the  model,  each  firm  receives  a 
private  signal  on  the  success  probability  of  one  of  many  potential  research  projects  and 
decides  when  and  which  project  to  implement.  A  successful  innovation  can  be  copied  by 
other  firms,  thus  reducing  the  profits  of  the  original  innovator.  Firms  have  an  incentive 
to  delay  their  experimentation  decision  (investment  in  R&D)  and  free-ride  on  the  efforts 
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of  their  competitors.  Naturally,  symmetric  equilibria,  where  actions  do  not  depend  on 
the  identity  of  the  firm,  necessarily  involve  delayed  and  staggered  experimentation.  When 
the  signal  quality  is  the  same  for  all  players,  the  equilibrium  is  in  mixed  strategies  (pure¬ 
st  rat  egy  symmetric  equilibria  do  not  exist).  When  signal  quality  differs  across  firms,  the 
equilibrium  is  represented  by  a  function  r  (p)  which  specifies  the  time  at  which  a  firm 
with  signal  quality  p  experiments.  As  in  the  environment  with  common  signal  quality,  the 
equilibrium  may  involve  arbitrarily  long  delays. 

We  also  show  that  the  social  cost  of  insufficient  experimentation  incentives  can  be 
arbitrarily  large.  The  optimal  allocation  may  require  simultaneous  rather  than  staggered 
experimentation.  In  this  case,  the  efficiency  gap  between  the  optimal  allocation  and  the 
equilibrium  can  be  arbitrarily  large.  Instead,  when  the  optimal  allocation  also  calls  for 
staggered  experimentation,  the  equilibrium  is  inefficient  because  of  delays.  We  show  that 
in  this  case  the  ratio  of  social  surplus  in  the  equilibrium  to  that  in  the  optimal  allocation 
can  be  as  low  as  1/2. 

One  of  the  main  arguments  we  make  is  that  appropriately-designed  patent  systems  en¬ 
courage  experimentation  and  reduce  delays  without  preventing  efficient  ex  post  transfer 
of  knowledge  across  firms.  Consequently,  when  signal  quality  is  the  same  for  all  firms, 
an  appropriately-designed  patent  system  can  ensure  that  the  optimal  allocation  results 
in  all  equilibria.  Patents  are  particularly  well-suited  to  providing  the  correct  incentives 
when  the  optimal  allocation  also  requires  staggered  experimentation.  In  this  case,  patents 
can  simultaneously  encourage  one  of  the  firms  to  play  the  role  of  a  leader  in  experimen¬ 
tation,  while  providing  incentives  to  others  to  copy  successful  innovations.  Technically, 
appropriately-designed  patents  destroy  symmetric  equilibria,  which  are  the  natural  equi¬ 
libria  in  the  absence  of  patents  but  may  involve  a  high  degree  of  inefficiency.  That  patents 
are  an  attractive  instrument  in  this  environment  can  also  be  seen  from  our  result  that, 
while  patents  can  implement  the  optimal  allocation,  there  exists  no  simple  subsidy  (to  ex¬ 
perimentation,  research,  or  innovation)  that  can  achieve  the  same  policy  objective.  When 
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signal  quality  differs  across  firms,  patents  are  again  useful  in  encouraging  experimentation 
and  reduce  delays,  however  typically  they  are  unable  to  ensure  the  optimal  allocation. 

We  believe  that  the  role  of  patents  in  encouraging  socially  beneficial  experimentation 
is  more  general  than  the  simple  model  used  in  this  thesis.  In  particular,  throughout  our 
analysis  we  only  briefly  consider  the  consumer  side.  It  is  possible  that  new  innovations 
create  benefits  to  consumers  that  are  disproportionately  greater  than  the  use  of  existing 
successful  innovations  (as  compared  to  the  relative  profitabilities  of  the  same  activities). 
In  this  case,  the  social  benefits  of  experimentation  are  even  greater  and  patents  can  also 
be  useful  in  preventing  copying  of  previous  successful  innovations.  The  investigation  of 
the  welfare  and  policy  consequences  of  pursuing  successful  lines  versus  experimenting  with 
new,  untried  research  lines  is  an  interesting  and  underresearched  area. 

9.2  Information  exchange  in  endogenous  social  net¬ 
works 

In  the  second  part  of  the  thesis,  we  develop  a  model  of  information  exchange  through 
communication  in  social  networks  and  investigate  its  implications  for  information  aggre¬ 
gation  in  large  societies.  An  underlying  state  (of  the  world)  determines  which  action  has 
higher  payoff.  Agents  decide  which  agents  to  form  a  communication  link  with  incurring 
the  associated  cost  and  receive  a  private  signal  correlated  with  an  underlying  state.  They 
then  exchange  information  over  their  communication  network  until  taking  an  (irreversible) 
action. 

Our  focus  has  been  on  asymptotic  learning,  defined  as  the  fraction  of  agents  taking  a 
nearly  optimal  action  converging  to  one  in  probability  as  a  society  grows  large.  We  show 
that  perfect  asymptotic  learning  occurs  if  and,  under  some  additional  mild  assumptions, 
only  if  the  communication  network  includes  information  hubs  and  most  agents  are  at  a  short 
distance  from  a  hub.  Thus  asymptotic  learning  requires  information  to  be  aggregated  in 
the  hands  of  a  few  agents.  This  kind  of  aggregation  also  requires  truthful  communication, 
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which  we  show  is  an  equilibrium  of  the  strategic  communication  in  large  societies.  We 
also  explore  the  welfare  implications  of  equilibrium  behavior  by  comparing  the  aggregate 
expected  welfare  at  equilibrium  with  that  at  the  optimal  allocation  (when  a  social  planner 
dictates  the  timing  of  actions). 

In  the  third  part,  we  extend  the  information  exchange  model  by  allowing  endogenous 
network  formation.  Using  our  analysis  of  information  exchange  over  a  given  network,  we 
provide  a  systematic  investigation  of  what  types  of  cost  structures,  and  associated  so¬ 
cial  cliques  which  consist  of  groups  of  individuals  linked  to  each  other  at  zero  cost  (such  as 
friendship  networks),  ensure  the  emergence  of  communication  networks  that  lead  to  asymp¬ 
totic  learning.  Our  main  result  on  network  formation  shows  that  societies  with  too  many 
(disjoint)  and  sufficiently  large  social  cliques  do  not  form  communication  networks  that 
lead  to  asymptotic  learning,  because  each  social  clique  would  have  sufficient  information  to 
make  communication  with  others  not  sufficiently  attractive.  Asymptotic  learning  results 
if  social  cliques  are  neither  too  numerous  nor  too  large  so  as  to  encourage  communication 
across  cliques. 

Interesting  avenues  for  research  include  investigation  of  similar  dynamic  models  of  infor¬ 
mation  exchange  and  network  formation  in  the  presence  of  ex  ante  or  ex  post  heterogeneity 
of  preferences  as  well  as  differences  in  the  quality  of  information  available  to  different  agents, 
which  may  naturally  lead  to  the  emergence  of  hubs.  Furthermore,  the  present  model  con¬ 
siders  only  purely  informational  externalities  among  the  agents.  It  is  conceivable,  though, 
that  one’s  action  directly  affects  the  expected  payoffs  of  her  peers.  Assuming  that  agents’ 
incentives  are  misaligned  leads  to  the  following  natural  question:  under  what  conditions  can 
truthful  communication  be  sustained?  Alternatively,  one  can  think  of  introducing  payoff 
externalities  in  our  model  as  extending  cheap-talk  models  by  considering  a  multi-period, 
dynamic  environment  (as  opposed  to  the  static,  one-shot  models,  that  are  prevalent  in  the 
literature) . 
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Appendix  A 


Omitted  Proofs  from  Chapters  1-4 

Proof  of  Lemma  1 

The  proof  comprises  three  steps.  First,  we  show  that  t  =  0  belongs  to  the  support  of  mixing 
time  (so  that  there  is  no  time  interval  with  zero  probability  of  experimentation).  Suppose, 
to  obtain  a  contradiction,  that  ty  =  inf {t:  A (t)  >  0}  >  0.  Then,  because  experimenting 
after  tx  is  in  the  support  of  the  mixed-strategy  equilibrium,  equilibrium  payoffs  must  satisfy 


Vi  =  e~rtlpU2. 

Now  consider  deviation  where  firm  i  chooses  A  (0)  =  Too.  This  has  payoff 

Vo  =  pIL  >  V\ 


for  any  ty  >  0,  yielding  a  contradiction. 

Second,  we  show  that  there  does  not  exist  T  <  oo  such  that  the  support  of  the  stopping 
time  r  (induced  by  A)  is  within  [0,T].  Suppose  not,  then  it  implies  that  there  exists 
t  €  [0,T]  such  that  \{t)  =  +oo  and  let  U  =  inf{t:  A (t)  =  Too}.  This  implies  that  the 
payoff  to  both  firms  once  the  game  reaches  time  t-y  without  experimentation  (which  has 
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positive  probability  since  ti  =  inf {t:  A (t)  —  Too})  is 

V  (t  =  ti)  =  e~rtlpTi2 

(where  V(r  =  t),  or  V  (t),  denotes  present  discounted  value  as  a  function  of  experimenta¬ 
tion  time).  Now  consider  a  deviation  by  firm  i  to  strategy  r',  which  involves  waiting  for 
e  >  0  after  the  game  has  reached  tx  and  copying  a  successful  project  by  firm  ~  i  (if  there 
is  such  a  success).  This  has  payoff 

V(rl)  =  e^t+^\pU2  +  (l~p)pU1} 

since  firm  ~  i  is  still  A(£i)  =  +oo  and  will  thus  experiment  with  probability  1  at  t\. 
Assumption  1  implies  that  V  ( r ')  is  strictly  greater  than  V  {t  =  t\)  for  e  sufficiently  small. 

Finally,  we  show  that  A  (t)  >0  for  all  t.  Again  suppose,  to  obtain  a  contradiction, 
that  there  exist  tx  and  t2  >  h  such  that  A (t)  #  0  for  t  G  (ti,t2).  Then,  with  the  same 
argument  as  in  the  first  part,  the  payoff  from  the  candidate  equilibrium  strategy  r  to  firm 
i  conditional  on  no  experimentation  until  t\  is 

V  (r)  =  e-rt2pU2. 

However,  deviating  and  choosing  r'  =  t\  yields 

V  (t*  =  ti)  =  e~rtlpU2  >  V  (r) . 

This  contradiction  completes  the  proof  of  the  lemma. 

Proof  of  Proposition  12 

The  proof  consists  of  two  main  steps.  The  first  involves  characterizing  the  equilibrium 
with  two  firms  when  p  has  support  [a,  b]  C  [0,/?].  The  second  involves  extending  the 
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characterization  of  equilibrium  to  the  more  general  case  when  the  support  of  G  is  [a,  b)  C 
[0,1]. 

Step  1:  We  show  that  under  the  assumption  that  [a,  b]  C  [0,/S],  there  exists  a  symmetric 
equilibrium  represented  by  a  strictly  decreasing  function  r{p)  with  ~r (6)  =  0  which  maps 
signals  to  time  of  experimentation  provided  that  the  other  player  has  not  yet  experimented. 
Proposition  20  formalizes  this  idea  and  is  proved  by  using  a  series  of  lemmas. 

Proposition  20.  Suppose  that  the  support  of  G  is  [a,  b]  C  [0,  f3\ .  Define 

if  fb  1 

T(p)  =  —  log G(b)  (1  -  b)  -  log G(p)  (1  -p)+  /  log G(z)dz  .  (A.l) 

T  fj  J  p 

Then  the  unique  symmetric  equilibrium  takes  the  following  form: 

1.  each  firm  copies  a  successful  innovation  and  immediately  experiments  if  the  other  firm 
experiments  unsuccessfully; 

2.  firm  i  with  signal  quality  pi  experiments  at  time  r  (pfi  given  by  (A.l)  unless  firm  ~  i 
has  experimented  before  time  r{pi). 

Proof.  The  proof  uses  the  following  lemmas. 

Lemma  5.  r{p)  cannot  be  locally  constant.  That  is,  there  exists  no  interval  P  =  [p,p  +  e\ 
with  e  >  0  such  that  r(p )  =  t  for  all  p  G  P. 

Proof.  Suppose,  to  obtain  a  contradiction,  that  the  equilibrium  involves  r(p)  =  t  for  all 
p  e  P.  Then,  let  pt  G  P.  Firm  i's  (time  t)  payoff  after  the  game  has  reached  (without 
experimentation)  time  t  is 

v  (t  |  p^  =  pi  [(G  (p  +  e)  -  G  (p))  III  +  (1  -  G  {p  +  e)  +  G  (p))  n2] , 

since  with  probability  G  (p  +  e)  —  G  (p)  firm  ~  i  has  p  G  P  and  thus  also  experiments 
at  time  t.  In  this  case,  firm  i,  when  successful,  is  not  copied  and  receives  III.  With 
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the  complementary  probability,  it  is  copied  and  receives  II2.  Now  consider  the  deviation 
T(Pi)  =  t  +  5  for  6  >  0  and  arbitrarily  small.  The  payoff  to  this  is 

vd  (t  |  Pi)  =  e~r5  [{G  (p  +  e)  -  G  (p))  ((II2  +  (1  -  C)pJIi)  +  (1  -  G  (p  +  e)  +  G  (Pl))  PlU2] , 

where  C  =  E  [p  \  p  G  P]  is  the  expected  probability  of  success  of  a  firm  with  type  in  the 
set  P .  Since  If2  >  pjl\,  we  have  £fl2  +  (1  —  C) P*rii  >  PiUi-  Moreover,  by  the  assumption 
that  G  has  strictly  positive  density,  G  (p  +  e)  -  G  (p)  >  0.  Thus  for  5  sufficiently  small,  the 
deviation  is  profitable.  This  contradiction  establishes  the  lemma.  ■ 

Lemma  6.  r(p)  is  continuous  in  [ a,b ]. 

Proof.  Suppose  r  (p)  is  discontinuous  at  p.  Assume  without  loss  of  generality  that  r  (p+)  = 
hnpp)  t  (p  +  c)  >  r  (p-)  =  lim^o  r  (p  +  e).  Then  firms  with  signal  p  =  p  +  8  for  sufficiently 
small  S  >  0  can  experiment  at  time  r  (p-)  +  e  for  e  <  r  (p+)  -  r  (p~)  and  increase  their 
payoff  since  r  >  0.  ■ 

Lemma  7.  r(p)  is  strictly  monotone  on  [a,  6] . 

Proof.  Suppose,  to  obtain  a  contradiction,  that  there  exist  qx  >  q2  such  that  r(q1)  = 
t(^2)  =  t.  Suppose  that  ~  i  follows  the  ecpiilibrium  strategy  characterized  by  r(p)  and 
consider  firm  i’s  expected  profit  when  Pi  =  q  and  it  chooses  to  experiment  at  time  t.  This 
can  be  written  as 

V(q,  t)=  f  e~rT(p)  (Pn2  +  (1  -p)  9n0  dG(p)  +  e~rtqU2  [  dG(p ),  (A.2) 

JPePbefore  -> P^after 

where  P^/ore  =  { P:  t(p)  <  t}  and  P after  =  {p:  t(p)  >  *}•  Notice  that  V(q,t)  is  linear  in  q. 

For  r(p)  to  characterize  a  symmetric  equilibrium  strategy  and  given  our  assumption 
that  r(qi)  =  r(q2)  =  r,  we  have 

v  (9i»^)  >  V  (qi,  t')  and  V  (q2,f)  >  V  ( q2,t ')  (A.3) 
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for  all  t'  G  M+. 

Now  take  q  =  aqx  +  (1  -  a)q2  for  some  a  G  (0, 1).  By  the  linearity  of  V  ( q,t ),  this 
implies  that  for  any  1  ^  f,  we  have 

V'(agi  I  (1  fv)(/2,  /.)  =  afo  (qi,t)  +  (1  -  a)  V  (q2,t) 

<  aV(ql,T{q1))  +  {l-a)V(q2,T(q2)) 

=  V{aqi  +  (l-a)q2,T), 

where  the  middle  inequality  exploits  (A. 3).  This  string  of  inequalities  implies  that 

r  (a.qi  +  (1  -  a)q2)  =  f  for  a  G  [0, 1]. 

Therefore,  r  must  be  constant  between  q\  and  q2.  But  this  contradicts  Lemma  5,  estab¬ 
lishing  the  current  lemma.  ■ 

The  three  lemmas  together  establish  that  r  is  continuous  and  strictly  monotone.  This 
implies  that  r  is  invertible,  with  inverse  r"1  (f).  Moreover,  r  (b)  =  0,  since  otherwise  a 
firm  with  signal  b  —  e  could  experiment  earlier  and  increase  its  payoff.  Now  consider  the 
maximization  problem  of  firm  i  with  signal  q.  This  can  be  written  as  an  optimization 
problem  where  the  firm  in  question  chooses  the  threshold  signal  p  =  r^1  (£)  rather  than 
choosing  the  time  of  experimentation  t.  In  particular,  this  maximization  problem  can  be 
written  as 

max  f  e-rr(p~i)  +  (1  -  p^)qlli)  dG  (p^)  +  e~rT(p)G  (p)  qll2,  (A. 4) 

pe[a,b]  Jv 

where  the  first  term  is  the  expected  return  when  the  firm  ~  i  has  signal  quality  p^i  G  [p,  b] 
and  the  second  term  is  the  expected  return  when  p^i  <  p,  so  that  firm  ~  i  will  necessarily 
copy  from  i’s  successful  innovation. 

Next,  suppose  that  r  is  differentiable  (we  will  show  below  that  r  must  be  differentiable). 
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Then  the  objective  function  (A. 4)  is  also  differentiable  and  the  first-order  optimality  con¬ 
dition  can  be  written  (after  a  slight  rearrangement)  as 


rr'(p) 


9  ip ) 

G(P ) 


i  -^-(i-p)r1 


In  a  symmetric  equilibrium,  the  function  r  ( p )  must  be  a  best  response  to  itself,  which 
here  corresponds  to  p  =  q.  Therefore,  when  differentiable,  r  (p)  is  a  solution  to 


rT\p)  =  1- 


(A. 5) 


Integrating  this  expression,  then  using  integration  by  parts  and  the  boundary  condition 
r(b)  =  0,  we  obtain  the  unique  solution  (when  r  (p)  is  differentiable)  as 


r{p)  = 


J_  f\,  _  ,  9  (z) 

rpjp{  )C(/Z 


log G(6)  (1  -  b)  -logG(p)  (1  -p)  4-  J  log G(z)dz 


To  complete  the  proof,  we  need  to  establish  that  this  is  the  unique  solution.  Lemmas 
6  and  7  imply  that  r  (p)  must  be  continuous  and  strictly  monotone.  The  result  follows  if 
we  prove  that  r  ( p )  is  also  differentiable.  Recall  that  a  monotone  function  is  differentiable 
almost  everywhere,  i.e.,  it -can  can  have  at  most  a  countable  number  of  points  of  non¬ 
differentiability  (see,  for  example,  [28],  p.  101,  Theorem  3.23).  Take  p  to  be  a  point  of 
non-differentiability.  Then  there  exists  some  sufficiently  small  e  >  0  such  that  r  (p)  is 
differentiable  on  (p  —  e,p)  and  on  (p,p  +  e).  Then  (A. 5)  holds  on  both  of  these  intervals. 
Integrating  it  over  these  intervals,  we  obtain 


r(p) 
r  ip) 


1  fP  q  (z) 

T  ~  e)  -  rp  J-  G\z)dZ  f°r  Pe(P~e’P)’  and 

1  fP  q  (z) 

T  ip)  -  ^  J  (1  -  z)  Qjzjdz  for  P  e  (P’P  +  e)  • 
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Now  taking  the  limit  e  ->  0  on  both  intervals,  we  have  either  (i)  r  (p+)  ±  r(p-);  or 
(ii)  r  (p+)  =  t  (p—).  The  first  of  these  two  possibilities  contradicts  continuity,  so  (ii)  must 
apply.  But  then  r  (p)  is  given  by  (A.l)  and  is  thus  differentiable.  This  argument  establishes 
that  r  (p)  is  differentiable  everywhere  and  proves  the  uniqueness  of  equilibrium.  ■ 

Step  2:  To  complete  the  proof  of  Proposition  12  we  need  to  consider  the  more  general 
case  when  the  support  of  G  is  [a,  b }  C  [0, 1].  This  consists  of  the  showing  three  additional 
claims.  First,  we  show  that  firms  with  p  >  ft  will  always  experiment  before  firms  with 
p  e  [ a,/3 ).  The  claim  follows  by  a  single-crossing  argument.  First,  recall  that  the  value  of 
experimenting  at  time  t  for  a  firm  with  p  E  [a,  j3)  is  given  by  (A. 2).  Defining  Ptajler?\a+  =  {P: 
r(p)  >  t  and  p  >  /?}  and  P*/ter =  {p:  r(p)  >  t  and  p  <  /?},  the  value  of  experimenting 
for  a  firm  with  q  E  [a,  /?)  can  be  rewritten  as 


V(q,t)  = 


<?n2 


ipeP, 


before 


r(v)  (P  +  Lll 
\q  3 


dG(p)  +  e 


—rt 


*pePt 


after  A(3+ 


dG(p)+  /  dG(p) 

•'P^Pafte  r/\p- 

(A.6) 


which  exploits  the  fact  that  when  p  E  Plepyn:  or  when  p  E  Pu/terAa+ ,  there  will  be  no 
copying,  and  when  p  E  Ptafte.rf,s_ ,  the  innovation  (which  takes  place  again  with  probability 
q)  will  be  copied,  for  a  payoff  of  n2  =  /3 III . 

Next,  turning  to  firms  with  p  —  q'  >  /3,  recall  that  these  firms  prefer  not  to  copy  prior 
successful  experimentation  (from  Lemma  3).  Therefore,  their  corresponding  value  can  be 
written  as 


V(q',  t)  =  «'n2  I  i  f  e-'3r>dG(p)  +  e-rt  ±  /  dG(p)  +  f 

l  P  JpePLjor*  lP  Jp€P'-  /terA/3-F  J  P^KfterAP- 


dG(p) 


(A.l) 


Note  also  that  when  the  experimentation  time  is  reduced,  say  from  t  to  t'  <  t.  the 
first  integral  gives  us  the  cost  of  such  a  change  and  the  second  expression  ( e~rt  times  the 
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square  bracketed  term)  gives  the  gain.  Now  the  comparison  of  (A. 6)  to  (A. 7)  establishes 
the  single-crossing  property,  meaning  that  at  any  t  a  reduction  to  t'  <  t  is  always  strictly 
more  valuable  for  q'  >  8  than  for  q  £  [a,  (3).  First,  the  gains,  given  by  the  expression  in 
(A. 6)  and  (A. 7)  are  identical.  Second,  the  term  in  parenthesis  in  the  first  integral  in  (A. 6) 
is  a  convex  combination  of  1/q  >  1//3  and  1  /(3,  and  thus  is  strictly  greater  than  1//3,  so 
that  the  cost  is  always  strictly  greater  for  q  £  [a,  f3)  than  for  q'  >  8.  From  this  strict 
single-crossing  argument  it  follows  that  there  exists  some  T  such  that  r  (p)  <  T  for  all 
p  >  f3  and  r  (p)  >  T  for  all  p  £  [a,  j3). 


The  second  claim  establishes  that  all  firms  with  p  >  (3  will  experiment  immediately,  that 
is,  t  (p)  =  0  for  all  p>  (3.  To  show  this,  first  note  that  all  terms  in  (A. 7)  are  multiplied  by 
q'  >  /3,  so  the  optimal  set  of  solutions  for  any  firm  with  p>  8  must  be  identical.  Moreover, 
since  r(p)  >  T  for  all  p  £  [a,  (3),  PlfterNi_  is  identical  for  all  t  £  [0,T],  and  t  >  0  is 
costly  because  r  >  0.  Therefore,  the  unique  optimal  strategy  for  all  p  >  j3  is  to  experiment 
immediately.  Therefore,  r  (p)  =  0  for  all  p  >  (3. 


Finally,  we  combine  the  equilibrium  behavior  of  firms  with  p  >  f3  with  those  oip  £  [a,  8). 
First,  suppose  that  b  <  (3.  Then  the  characterization  in  Proposition  20  applies  exactly. 
Next  suppose  that  b>  (3,  so  that  some  firms  might  have  signals  p  >  (3.  The  previous  step 
of  the  proof  has  established  that  these  firms  will  experiment  immediately.  Subsequently, 
firms  with  p  £  [a,  (3)  will  copy  a  successful  innovation  at  time  t  =  0  or  experiment  if  there 
is  an  unsuccessful  experimentation  at  t  =  0.  If  there  is  no  experimentation  at  t  =  0,  then 
equilibrium  behavior  (of  firms  with  p  £  [ a,/? ))  is  given  by  Proposition  20  except  that  the 
upper  support  is  now  j3  and  the  relevant  distribution  is  G  (p)  conditional  on  p  £  [a,  (3),  thus 
all  terms  are  divided  by  G  ((3).  This  completes  the  proof  of  Proposition  12. 
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Proof  of  Proposition  13 

The  proof  mimics  that  of  Proposition  12.  with  the  only  difference  that  the  maximization 
problem  of  firm  i,  with  signal  p,  =  q ,  is  now  modified  from  (A. 4)  to 


max  T  e~rT(P~b  (p^  (n2  -  r,)  +  (1  -  p~i)  qlh)  dG  (p^)  +  err(p)G  (p)  q  (n2  +  p)  , 

p£[aM  Jp 


which  takes  into  account  that  copying  has  cost  rj  and  if  firm  i  is  the  first  innovator,  then 
it  will  be  copied  and  will  receive  p.  Repeating  the  same  argument  as  in  Proposition  12 
establishes  that  the  unique  equilibrium  is  given  by  (4.5). 

To  prove  the  second  part  of  the  proposition,  first  suppose  that  6  <  pv  and  b  <  pn'  so 
that  6  =  b  in  both  cases.  Recall  also  that  r7'  (p)  =  f 77  (p)  >  0  for  p  E  [a,pv).  pr>  is  decreasing 
in  p,  so  that  r77  (p)  =  0  implies  that  r77'  (p)  =  0  for  any  rf  >  p.  We  therefore  only  need 
to  show  that  fr'(p)  is  strictly  decreasing  for  all  p  G  [a,;/7).  Since  f 77  (p)  is  differentiable, 
it  is  sufficient  to  show  that  its  derivative  with  respect  to  p  is  negative.  This  follows  by 
differentiating  (4.5)  (with  6  =  6).  In  particular, 


drTI(p)  _  1 

dr]  (n2  +  p) 


fV  ^  +  ^°S  °  ^  ~  bS  G  ^ 


<0, 


since  log  G{b)  >  log  G{p)  and  r77  (p)  >  0. 

Next,  suppose  that  6  >  p77'.  In  that  case  6  =  p77'  and  drT1(p)/dr]  (in  the  neighborhood  of 
rf)  will  include  additional  terms  because  of  the  effect  of  r/  on  6.  In  particular: 


drv'  (p) 
dp 


wrv)  r 00 + dry  (logG  M  ■ logG  <p)) 

gfo’O  n! -2p/-p77'ni  gjpv')  , 

G'(p77')n1  r(n2  +  p7)G  (p77')  G(p’7')n1  ^ ' 


(A.8) 


The  first  line  is  again  strictly  negative  and  so  is  the  first  expression  in  the  second  line.  The 
second  expression  in  the  second  line  could  be  positive,  however.  For  given  ?/,  this  term 
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is  decreasing  in  p  and  tends  to  0  as  p  approaches  p11'  (from  (4.5)).  Therefore,  there  exists 
p*  ( r f)  such  that  for  p  >  p*  (?/),  it  is  no  larger  than  the  first  term  in  the  second  line.  This 
establishes  that  for  p  E  [p*  (?/) , p'1 ),  rn  (p)  is  again  decreasing  in  77,  completing  the  proof. 
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Appendix  B 


Equilibrium  with  Private  Signals  and 
Multiple  Firms 


We  now  extend  the  analysis  of  the  private  and  heterogeneous  signals  environment  from 
Section  4  to  the  case  with  multiple  firms.  In  particular,  we  show  that  all  equilibria  are 
payoff  equivalent  for  all  players  and  involve  firms  with  strong  signals  ( Pi  >  ft )  experiment 
first.  We  also  discuss  briefly  the  structure  of  those  equilibria  and  provide  an  explicit 
characterization  of  the  unique  mixed  equilibrium,  when  all  firms  with  signals  >  ft  use  the 
same  experimentation  strategy. 

Proposition  21.  Suppose  that  there  are  N  >  3  firms  and  the  support  of  G  satisfies  [a,  b]  (ft 
[0, /?].  Then: 

1.  There  does  not  exist  a  symmetric  equilibrium  in  which  all  firms  withp  >  ft  experiment 
at  t  =  0. 

2.  All  symmetric  equilibria  involve  firms  with  p  >  ft  experimenting  in  the  time  interval 
[0,  T]  and  the  rest  of  the  firms  experiment  after  T  (if  there  is  no  prior  experimentation) 
for  some  T  >  0. 

3.  All  symmetric  equilibria  take  the  following  form: 
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( a)  Firms  with  p  >  {3  experiment  in  time  interval  [0,  T]  with  flow  rate  of  experimen¬ 
tation  f(p,  t). 

(b)  If  there  is  not  any  prior  experimentation,  a  firm  with  signal  pt  <  (3  experiments 
at  time  r(pi)  >  T,  where  r(.)  is  a  strictly  decreasing  function. 

Moreover,  all  such  equilibria  are  payoff  equivalent  for  all  players. 


Note  that  the  characterization  of  Part  3  allows  for  pure  strategies  from  firms  with  strong 
signals  (p  >  33)  -  in  fact  there  is  such  an  equilibrium. 

Proof.  (Part  1)  Suppose  that  N  =  3,  and  that  U2  =  n3.  Suppose,  to  obtain  a  contradic¬ 
tion,  that  there  exists  a  symmetric  equilibrium  where  all  firms  with  p  >  (3  experiment  at 
t  =  0.  Consider  firm  i  with  p7  >  f3.  Let  y0  be  the  probability  that  none  of  the  other  two 
firms  have  p  >  (3,  Xi  be  the  probability  that  one  of  the  other  two  firms  has  p  >  f3  and  y  2 
be  the  probability  that  both  firms  have  p  >  3.  Let  us  also  define  (  =  E  [p  |  p  >  (3).  Since 
Pi  >  (3 ,  by  hypothesis,  firm  i  experiments  at  time  t  —  0.  Its  expected  payoff  is 


V  (pi,  0)  =  XoPi^  +  XiPi 


+  (i-C)n2 


+  X2Pi  ni- 


Intuitively,  when  none  of  the  other  two  firms  have  p  >  (3,  when  successful,  the  firm  is  copied 
immediately,  receiving  payoff  n2.  When  both  of  the  other  two  firms  have  p  >  3,  there  is 
no  copying,  so  when  successful,  firm  i  receives  Lb.  When  one  of  the  other  two  firms  has 
p>3,  then  this  other  firm  also  experiments  at  time  t  =  0  and  is  successful  with  probability 
C  =  E  [p  |  p  >  0] .  In  that  case,  in  a  symmetric  equilibrium  the  third  firm  copies  each  one  of 
the  two  successful  innovations  with  probability  1/2.  With  the  complementary  probability, 
1  —  C?  the  other  firm  with  p  >  [3  is  unsuccessful,  and  the  third  firm  necessarily  copies  firm 
i. 

Now  consider  the  deviation  to  wait  a  short  interval  e  >  0  before  innovation.  This  will 
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have  payoff 


lim  V  ( Pi ,  e)  =  XoPi^-2  +  Xi Pi  (Cni  +  (1  -  C)  H2)  +  X2 

e|0 

>  v(Pl,0). 

The  first  line  of  the  previous  expression  follows  since,  with  this  deviation,  when  there  is 
one  other  firm  with  p  >  d,  the  third  firm  necessarily  will  copy  the  first  innovator.  The 
inequality  follows  since  IT  >  n2,  establishing  that  there  cannot  be  an  equilibrium  in  which 
all  firms  with  p  >  /3  experiment  at  time  t  =  0.  This  argument  generalizes,  with  a  little 
modification,  to  cases  in  which  N  >  3  and  IIns  differ. 

(Part  2)  Part  2  follows  from  a  single  crossing  argument  similar  to  the  one  used  in  the 
proof  of  Proposition  12. 

(Part  3  -  Sketch)  We  give  an  argument  for  why  all  equilibria  are  payoff  equivalent 
for  all  players.  Consider  firms  with  strong  signals,  i.e.,  firms  such  that  p  >  ft.  Note  that 
i t(p,t)  >  0  for  at  least  some  p  for  all  t  6  [0,  T].  It  is  evident  now  from  (A. 7)  in  the  proof  of 
Proposition  20  that  all  firms  with  signal  p  >  j3  have  to  be  indifferent  between  experimenting 
at  any  time  t  in  [0,  T]  and,  in  particular,  between  experimenting  at  t  and  at  0.  Combined 
with  Part  1,  this  implies  that  the  expected  payoff  of  a  firm  with  signal  p  >  (3  is  equal  to 

pHiWipi  >  f3,  for  all  i)  +  -pIT  (1  —  P(p*  >  P,  for  all  i)) , 

in  all  equilibria  (no  matter  what  the  equilibrium  strategy  profile  is).  Similarly,  we  can  show 
that  the  expected  payoff  for  the  rest  of  the  firms  (firms  with  weak  signals)  is  also  the  same 
in  all  equilibria.  ■ 

In  Proposition  21  we  did  not  explicitly  describe  any  equilibrium.  Proposition  22  pro¬ 
vides  a  characterization  of  the  unique  mixed  strategy  equilibrium,  when  firms  with  signals 
p  >  3  use  the  same  experimentation  strategy,  i.e.,  the  rate  of  experimentation  for  those 
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firms  depends  only  on  time  t  (not  their  signal  quality),  £(p,t)  =  £(t).  To  simplify  the 
exposition,  we  focus  on  an  economy  in  which  N  —  3  and  G  is  uniform  on  [0, 1],  The  char¬ 
acterization  result  in  this  proposition  can  be  (in  a  relatively  straightforward  way)  extended 
to  N  >  3.  We  also  conjecture  that  it  can  be  extended  to  any  distribution  G,  though  this 
is  less  trivial. 


Proposition  22.  Consider  an  economy  with  N  =  3  firms,  If2  =  n3  and  G  uniform  over 
[0, 1],  Then,  the  following  characterizes  the  unique  symmetric  equilibrium,  when  firms  with 
signals  p  >  /3  use  the  same  experimentation  strategy. 

•  Firms  with  p  >  fi  experiment  at  the  flow  rate  f(t)  as  long  as  no  other  firm  has 
experimented  until  t.  They  experiment  immediately  following  another  ( successful  or 
unsuccessful)  experiment.  There  exists  T  <  oo  such  that 

e~  Jo  €(t)dt  _  q 

That  is,  all  firms  with  p  >  (3  will  have  necessarily  experimented  within  the  interval 
[0,  T]  (or  equivalently,  lim t^Tf(t)  =  -fooj. 

•  Firms  with  p  <  (3  immediately  copy  a  successful  innovation  and  experiment  at  time 
t 2  (p)  following  an  unsuccessful  experimentation  and  at  time  f3  (p)  if  there  has  been 
no  experimentation  until  time  T. 

Proof.  Let  us  define  p{t)  as  the  probability  that  firm  ~  i  that  has  not  experimented  until 
time  t  has  p^t  >  ft.  The  assumption  that  G  is  uniform  over  [0, 1]  implies  that  //(())  =  1  —  ft. 

Now  consider  the  problem  of  firm  i  with  Pi  >  (3.  If  there  has  yet  been  no  experimentation 
and  this  firm  experiments  at  time  t,  its  payoff  (discounted  to  time  t  =  0)  is 

v  (Pi,  t )  =  Pie~rt [II1/r(t)2  +  n2(l  -  p(t)2)\, 
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since  p  (f)2  is  the  probability  with  which  both  other  firms  have  p  >  f3  and  will  thus  not 
copy.  With  the  complementary  probability,  its  innovation  will  be  copied.  Alternately,  if  it 
delays  experimentation  by  some  small  amount  dt  >  0,  then  its  payoff  is: 


V  ( Pi,t  +  dt)  = 


p.e-dt+dt) 


2p£(t)p(t)dtUi 


+(1  -  2((t)p(t)dt)[Uip(t  +  dt)2  +  n2(l  -  pit  +  dt)2)} 


+2£(t)p(t)(l  -  p)dt[Uip(t  +  dt)  +  n2(l  -  p{t  +  dt))}  , 


where  p  =  E[p\p  >  6}  and  we  use  the  fact  that  other  firms  with  p  >  /3  experiment  at 
the  rate  £( t ).  In  a  mixed-strategy  equilibrium,  these  two  expressions  must  be  equal  (as 
dt  — >  0).  Setting  these  equal  and  rearranging,  we  obtain 

+  2«t)M«)(l  -  Mt))[p  +  m  -  r  At)  =  (B.l) 


In  addition,  the  evolution  of  beliefs  p  (t)  given  the  uniform  distribution  and  flow  rate  of 
experimentation  at  £  ( t )  can  be  obtained  as 


p{t) 


e-/0‘eWdr(i 


(B.2) 


Now  let  us  define 

f(t)  =  e-/o  «dd(r)(1_^)i 

Using  (B.3),  (B.2)  can  be  rewritten  as 


which  in  turn  implies 


p(t)  = 


m 

f{t)  +  P' 


m  = 


PKt) 

1  -  p{t) ' 


(B.3) 
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Moreover  (B.3)  also  implies  that 


m 


m 

fit) 


(i  -  fi{t))n{ty 


(B.4) 


Substituting  these  into  (B.l),  we  obtain  the  following  differential  equation  for  the  evolution 
of 

2 (*)  +  1  -  Kt))\p  +  v(t)}  ~  rfj?(t)  =  pL.l_x- 

Further  substituting  £(t)  from  (B.4),  we  obtain 

^(t)  =  ^ptr1  - 1) (1  +  (/3~'  ~  4  f‘1(t))' 

This  differential  equation  satisfies  the  Lipschitz  condition  and  therefore  it  has  a  unique 
solution,  which  takes  the  form 


V(t)  =  r—-r  tan 

Vr1  - 1 


arctan 


1) 


~t  + 


Vp  1  - 1 


with  boundary  condition  // ( 0 )  =  1-/3.  Given  this  solution,  the  flow  rate  of  experimentation 
for  firms  with  p  >  ft,  £  (t),  is  obtained  from  (B.4)  as 


€(t)  =  cic2(l  +  tan(c!(-c2i  +  c3))2) 


+ 


-ci  +  tan(ci(-c2t  +  c3))  tan(ci(-c2f  +  c3))J 


where 


Cl 


arctan 


i) 


and  c3  = 


vF 


It  can  then  be  verified  that 


lim  £  (t)  —  oo, 

t—>T 
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where  T  =  c3/c2.  It  can  also  be  verified  that  for  all  t  €  [0,  T],  where  firms  with  p  >  (3 
are  experimenting  at  positive  flow  rates,  firms  with  p  <  (3  strictly  prefer  to  wait.  The 
equilibrium  behavior  of  these  firms  after  an  unsuccessful  experimentation  or  after  time  T  is 
reached  is  given  by  an  analysis  analogous  to  Proposition  12.  Combining  these  observations 
gives  the  form  of  the  equilibrium  described  in  the  proposition.  ■ 
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Appendix  C 


Experimentation  in  Discrete  Time 


We  discuss  a  discrete  time  version  of  the  model  described  in  the  main  text.  We  formally 
show  that  the  continuous  time  model,  which  is  our  main  focus,  provides  the  same  economic 
and  mathematical  answers  as  the  limit  of  the  discrete-time  model,  when  the  length  of  the 
time  interval  A  -»  0.  We  limit  the  discussion  to  the  case  of  two  symmetric  firms. 

Environment 

Let  us  denote  the  time  interval  between  two  consecutive  periods  by  A  >  0.  In  what 
follows  we  will  take  A  to  be  small.  During  an  interval  of  length  A,  the  payoff  to  a  firm  that 
is  the  only  one  implementing  a  successful  project  is  7TiA  >  0.  In  contrast,  if  a  successful 
project  is  implemented  by  both  firms,  each  receives  7r2  A  >  0.  The  payoff  to  an  unsuccessful 
project  is  normalized  to  zero.  Both  firms  discount  the  future  at  the  common  rate  r  >  0  (so 
that  the  discount  factor  per  period  is  e~rA). 

Strategies  are  defined  in  this  game  as  follows.  Let  a  history  up  to  time  t  (where  t  —  kA 
for  some  integer  k )  be  denoted  by  hl .  The  set  of  histories  is  denoted  by  'Ht .  A  strategy 
for  a  firm  is  a  mapping  from  the  history  of  the  game  up  to  time  t,  hl ,  to  the  probability  of 
experimentation  at  a  given  time  interval  and  the  set  of  projects.  Thus  the  time  t  strategy 
can  be  written  as 

a*  -.H*  -t  [0,1]  x  {1,2}, 
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where  [0, 1]  denotes  the  probability  of  implementing  a  project  (either  experimenting  or 
copying)  at  time  t. 

Equilibria 

We  start  with  asymmetric  equilibria.  In  an  asymmetric  equilibrium,  one  of  the  firms,  say 
1,  immediately  experiments  with  its  project.  Firm  2  copies  firm  1  in  the  next  time  period  if 
the  latter  is  successful  and  tries  its  own  project  otherwise.  In  terms  of  the  notation  above, 
this  asymmetric  equilibrium  would  involve 

(OHM), 


for  t  =  0,  A,  2A, ....  In  words,  this  means  that  firm  1  chooses  to  experiment  immediately 
(if  it  has  not  experimented  yet  until  t )  and  experiments  with  its  project.  Firm  2,  on  the 
other  hand,  uses  the  strategy 


(1,1) 


(aV)  =  <  (1,2) 


if  a1  —  1  and  s*  =  1, 
if  a*  =  1  and  s*  m  0, 


[  (0,  •)  if  a1  =  0, 


for  t  =  0,  A,  2A, ....  The  crucial  feature  highlighted  by  these  strategies  is  that  firm  2  never 
experiments  until  firm  1  does. 

Using  the  same  analysis  as  in  the  proof  of  Proposition  2  below,  it  is  straightforward  to 
verify  that  <r2  is  a  best  response  to  ax  provided  that  A  <  A*  =  r_1  log  (/3  +  1  -  p).  What 
about  di?  Given  d2,  suppose  that  the  game  has  reached  time  t  (where  t  =  kA  for  k  G  N). 
If  firm  1  now  follows  oq,  it  will  receive  expected  payoff 


V[t]  =  p  (e  rt7Ti  A  +  e  r(t+A)n2)  , 

at  time  t,  since  its  experimentation  will  be  successful  with  probability  p,  yielding  a  profit  of 
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7TiA  during  the  first  period  following  the  success  (equivalent  to  e~rt7rxA  when  discounted 
to  time  t  =  0).  Then  according  to  <r2,  firm  2  will  copy  the  successful  project  and  firm  1 
will  receive  the  present  discounted  value  e-r(M-A)n2  from  then  on.  If,  at  this  point,  firm 
1  chooses  not  to  experiment,  then  the  game  proceeds  to  time  t  +  A,  and  according  to  the 
strategy  profile  (dx+A,  <72+A)  >  ^  will  receive  payoff  equal  to 

V[t  +  A]  =  e~~rAV  [t]  <V[t}. 

Therefore  this  deviation  is  not  profitable.  This  discussion  establishes  the  following  propo¬ 
sition  (proof  in  the  text). 

Proposition  23.  Suppose  that  Assumption  1  holds  and  that  A  <  A*  =  r-1  log  (/?  +  1  —  p). 
Then  there  exist  two  asymmetric  equilibria.  In  each,  one  firm,  i  =  1,2,  tries  its  project  with 
probability  1  immediately  and  the  other,  firm  ~  i,  never  tries  its  project  unless  it  observes 
the  outcomes  of  the  experimentation  of  firm  i.  Following  experimentation  by  i,  firm  ~  i 
copies  it  if  successful  and  experiments  with  its  own  project  otherwise. 

More  formally,  the  two  equilibria  involve  strategies  of  the  form: 


for  i  =  1,2. 


■)  =  (M) , 


1  (1,0 


(1»~  *) 


if  of  =  1  and  s *  =  1, 
if  of  =  1  and  s*  =  0, 


(0,  •)  ifaf  =  0, 


Next  we  study  symmetric  equilibria.  We  focus  on  the  case  where  the  time  interval  A  is 
strictly  positive  but  small. 

As  defined  above  a  firm’s  strategy  is  a  mapping  from  its  information  set  to  the  prob¬ 
ability  of  implementing  a  project.  We  refer  to  a  strategy  as  pure  if  the  experimentation 
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probability  at  a  given  time  t  is  either  0  or  1.  That  is,  a  pure  strategy  takes  the  form 


a1  :  V}  ->  {0,1}  x  {1,2}. 

Our  first  result  shows  that  for  small  A,  there  are  no  pure-strategy  symmetric  equilibria. 

Proposition  24.  Suppose  that  Assumption  1  holds  and  that  A  <  A*  =  log  ((3  +  1  —  p) 
(where  recall  that  (3  =  E^/Ifi}.  Then  there  exist  no  symmetric  pure-strategy  equilibria. 


Proof.  Suppose,  to  obtain  a  contradiction,  that  such  an  equilibrium  a*  exists.  The  first 
case  we  need  to  consider,  is  when  a*  involves  no  experimentation,  i.e.,  both  firms  wait  with 
probability  1  for  every  time  t.  Then,  it  is  straightforward  to  see,  that  firm  1  experimenting 
with  probability  1  at  time  t  =  0,  is  a  profitable  deviation.  This  implies  that  for  such  an 
equilibrium  o*  to  exist,  there  should  exist  some  time  t0  and  history  hto  (with  t0  =  kA  for 
k  G  N)  such  that  ato  ( (p  =  =  (1,  j).  Then  following  this  history  the  payoff  to  both 

firms  is 

V[t0\a*,<j*]=e-rt°pII1. 

Now  consider  a  deviation  by  firm  1  to  a',  which  involves,  after  history  hto,  waiting  until 
date  t0  +  A,  copying  firm  2’s  project  if  successful,  and  experimenting  with  its  own  project 
otherwise.  The  payoff  to  this  strategy  is 

v  [to  |  <x',  <T*]  =  e“rdo+A)  (pn2  +  (1  -  p)pU1) , 

since  firm  2  experiments  with  probability  1  at  time  t0  and  is  successful  with  probability  p. 
Clearly,  V  [f0  |  o7,  cr*]  >  V  [t0  \  cr*,  a*]  and  there  is  a  profitable  deviation  if 

e-r(t0+A)  (pn2  +  (i  -p)pU1)  >  e~rt°pHi, 
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A  <  A  *  =r  1  log  ((3  +  1  -  p)  ■ 


or  if 

Here  A*  >  0  since,  from  Assumption  1,  3  =  >  p.  This  establishes  the  existence  of 

a  profitable  deviation  and  proves  the  proposition.  ■ 

Proposition  24  also  implies  that  all  symmetric  equilibria  must  involve  mixed  strate¬ 
gies.  Moreover,  any  candidate  equilibrium  strategy  must  involve  copying  of  a  successful 
project  in  view  of  Assumption  1  and  immediate  experimentation  when  the  other  firm  has 
experimented.  Therefore,  we  can  restrict  attention  to  time  t  strategies  of  the  form 

(1,  i )  if  a4  =  1  and  s*  =  1, 

(1,  ~  i)  if  a4  =  1  and  s4  =  0,  (C-l) 

(q(t)A,i)  if  a4  =  0, 

for  t  =  0,  A,  2A, ....  where  q(t) A  is  the  probability  of  experimenting  at  time  t  conditional 
on  no  experimentation  by  either  firm  up  to  time  t  (all  such  histories  are  identical,  hence 
we  write  q(t)  instead  of  g(/i4)).  Clearly,  feasibility  requires  that  q(t)  A  <  1. 

Next  we  derive  an  explicit  characterization  of  the  unique  symmetric  equilibrium  as 
A  — »■  0.  In  the  text,  we  assume  that  firms  use  a  constant  probability  of  experimentation 
over  time,  i.e.,  q(t)  =  q  for  all  t  (Proposition  25  relaxes  this  assumption  and  establishes 
uniqueness  more  generally).  We  consider  a  symmetric  mixed-strategy  equilibrium  a*  and 
suppose  that  the  game  has  reached  time  t  without  experimentation.  Let  vw  [t  |  A]  and 
ve  [t  |  A]  denote  the  time  t  continuation  payoffs  to  firm  i  when  firm  ~  i  plays  a*  and 
firm  i  chooses  to  wait  or  to  experiment  (and  period  length  is  A).1  For  a  mixed-strategy 
equilibrium  to  exist,  we  need 

vw  [t  |  A]  =  ve  [t  |  A] .  (C.2) 

The  proof  of  Proposition  25  below  shows  that  all  symmetric  equilibria  involve  mixing  after 
1Here  we  use  v,  since  V  denotes  the  value  discounted  back  to  t  =  0. 
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any  history  h*  (with  no  experimentation  up  to  t),  i.e.,  equation  (C.2)  holds  for  all  such 
hf' ■  Therefore,  it  suffices  to  characterize  a*  such  that  (C.2)  holds.  First,  consider  firm  i' s 
payoffs  from  experimenting: 

t'e  [t  |  A]  =  qApUi  +  (1  -  gA)(p7rxA  +  e~rApU2),  (C.3) 

since  in  this  case  firm  i  is  successful  with  probability  p  and  receives  continuation  value  IT 
if  firm  ~  i  has  also  experimented  during  the  same  time  interval  (probability  qA),  and  it 
receives  7riA  +  e~rAn2  otherwise  (payoff  for  current  time  interval  plus  continuation  value). 

The  latter  event  occurs  with  probability  1  —  qA. 

Similarly,  its  payoff  from  waiting  is 

vw  [t  |  A]  =  e“rA  (qA  (pU2  +  (1  -  p)pU1)  +  (1  -  qA)vw  [t  +  A  |  A]  ) ,  (C.4) 

where  firm  i  receives  no  payoff  today  and  with  probability  qA.  firm  ~  i  experiments,  in 
which  case  firm  i  copies  if  the  experimentation  is  successful  and  experiments  with  its  own 
project  otherwise,  with  expected  continuation  return  pU2  +  (1  —  p)pUi.  With  probability 
1  -  qA,  firm  ~  i  does  not  experiment,  and  firm  i  then  receives  vw[t  +  A  \  A],  Adding  and 
subtracting  vw  [t  +  A  |  A]  from  the  left-hand  side  of  (C.4)  and  rearranging,  we  obtain 

vw  [t .  +  A  |  A]  (1  -  (1  -  qA)  e-rA)-(vw  [t  +  A  |  A]  —  vw  [t  |  A])  =  e~rAqA  (pU2  +  (1  -  p)  pU^  . 


Dividing  both  sides  by  A  and  taking  the  limit  as  A  -»  0  yields 


lim  <4,  [t  +  A  |  A] - —  lim  (S  [*  +  A  I  A1  W  [t  |  A] 

a->o  r  +  q  a->o  y  A 


r  +  q 


\pU2  +  (1  -p)  pill]  • 

(C.5) 


From  equation  (C.3),  we  see  that  ve[t  |  A]  does  not  depend  on  t.  Since  equation 
(C.2)  holds  for  all  hl  (thus  for  all  t),  we  have  vw  [t  |  A]  =  ve  [t  |  A]  and  vw  [i  +  A  |  A]  = 
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ve  [t  +  A  I  A],  implying  that  vw  [t  +  A  |  A]  =  vw  [t  \  A],  Therefore,  the  second  term  on  the 
left-hand  side  of  (C.5)  must  be  equal  to  zero.  Moreover,  taking  the  limit  as  A  — >  0  in  (C.3), 
we  obtain 

lim  vw  \t  +  A  I  A]  =  lim  ve  [t  I  A]  =  pn2. 

A— >0  A-vO 

Combined  with  (C.5),  this  yields 

q(t )  =  q*  =  ^  for  all  t,  (C.6) 


where  recall,  from  (2.1),  that  /3  =  n2/IIi. 

The  next  proposition  relaxes  the  assumption  that  q(t)  is  constant  for  all  t  and  shows 
that  this  is  indeed  the  unique  symmetric  equilibrium. 

Proposition  25.  Suppose  that  Assumption  1  holds  and  A  — >  0.  Then  there  exists  a  unique 
symmetric  equilibrium.  In  this  equilibrium,  both  firms  use  the  mixed  strategy  6  as  given  in 
(2.2)  with  q(t)  —  q*  as  in  (C.6). 

Proof.  We  first  show  that  any  symmetric  equilibrium  must  involve  mixing  after  any  his¬ 
tory  hl  e  TO'  along  which  there  has  been  no  experimentation.  The  argument  in  the  proof  of 
Proposition  24  establishes  that  after  any  such  history  fib  there  cannot  be  experimentation 
with  probability  1.  We  next  show  that  there  is  positive  probability  of  experimentation  at 
time  t  =  0.  First  note  that  the  equilibrium-path  value  to  a  firm,  V*,  (discounted  back 
to  time  t  —  0),  satisfies  V*  >  pU2,  since  each  firm  can  guarantee  this  by  experimenting 
at  time  t  =  0.  This  implies  that  in  any  equilibrium  there  must  exist  some  time  T  such 
that  after  time  T  there  is  positive  probability  of  experimentation  and  innovation.  Now 
to  obtain  a  contradiction,  suppose  that  T  >  0.  By  the  argument  preceding  the  propo¬ 
sition,  limA->o  K.  [T  |  A]  =  e"rTpll2,  and  therefore,  in  any  mixed-strategy  equilibrium, 
V*  [T  |  A]  — >  e_rrpn2.  However,  for  T  >  0  this  is  strictly  less  than  V*  >  pU2,  yield¬ 
ing  a  contradiction  and  establishing  the  desired  result.  The  same  argument  also  establishes 
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that  there  cannot  exist  any  time  interval  (T,T'),  with  T  >  T,  along  which  there  is  no 
mixing. 


Hence,  along  any  history  h*  where  there  has  not  been  an  experimentation,  both  firms 
must  be  indifferent  between  waiting  and  experimenting.  This  implies  that  (C.2)  must  hold 
for  all  t.  Let  q(t)  A  denote  the  probability  of  experimentation  at  time  t.  Firm  z’s  payoff  for 
experimenting  at  time  t  is  given  by  an  expression  similar  to  equation  (C.3), 

ve  [t  |  A]  =  q{t)ApIh  +  (1  -  q(t)A)pU2.  (C.7) 


We  next  show  that  the  probability  of  experimentation  q(t)  in  a  symmetric  equilibrium 
is  a  continuous  function  of  t.  Suppose  that  q(t)  is  not  continuous  at  some  i  >  0.  If 
q(t)  <  q(t+ )  (where  q(t+ )  =  limtyg  (t)).  it  then  follows  from  (C.7)  and  Assumption  1  that 
ve  [t  |  A]  <  ve  [t+  |  A] .  This  implies  that  firm  i  has  an  incentive  to  delay  experimentation  at 
time  t.  But  this  contradicts  the  fact  that  the  symmetric  equilibrium  must  involve  mixing 
at  all  such  t.  Similarly,  if  q(t)  >  g(f+),  we  have  ve[t\  A]  >  ve  [t+  |  A],  implying  that 
firm  i  will  experiment  with  probability  1  at  time  /”  again  yielding  a  contradiction.  This 
establishes  that  q(t )  is  a  continuous  function  of  t. 


A  derivation  similar  to  that  preceding  the  proposition  then  shows  that  equation  (C.5) 
holds  when  q  is  replaced  by  q(t).  In  particular, 


lim  vw  It  +  A  I  A1 - —  lim 

A— >o  r  +  q(t)  A— >o 


'vw  [t  +  A  |  A]  -  vw  [t  |  A]’ 


q(t) 


r  +  q(t ) 


\pU2  +  (1  ~p)pUi] 

(C.8) 


Since  vw  [t  |  A]  =  ve  [t  |  A]  and  vw  [t  +  A  |  A]  =  ve  [t  +  A  |  A],  we  can  write  the  second 
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term  on  the  left-hand  side  of  equation  (C.8)  as 


lim 

A— >-0 


^vw  [t  +  A  |  A]  -  vw  [t  |  A]  ^ 


lim 

A-s-0 


lim 

A -40 


^■C[t  +  A|  A]-t.,[f|A]j 
('](*  +  A)  -  g(i))  Ap(II]  -  n2) 


where  the  second  equality  follows  from  equation  (C.7)  and  the  third  equality  holds  by  the 
continuity  of  the  experimentation  probability  q(t).  Substituting  for 


lim  vw  [t  +  A  I  A]  =  lim  ve  [t  |  A]  =  pU2, 

A->0  A-40 

in  equation  (C.8)  and  solving  for  q(t)  yields  q(t)  =  q*  as  in  (C.6),  completing  the  proof.  ■ 
Note  that  the  limit  when  A  0,  which  Proposition  25  shows  is  well  behaved,  also 
corresponds  to  the  (symmetric)  equilibrium  of  the  same  model  set  up  directly  in  continuous 
time,  thus  showing  formally  the  equivalence  of  the  two. 
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Appendix  D 


Omitted  Proofs  from  Chapter  6 


Algorithm  for  observation  radius 

The  following  algorithm  computes  the  perfect  observation  radius  for  agent  i  and  com¬ 
munication  network  topology  Gn.  Specifically,  the  algorithm  computes  the  sequence  of 
optimal  decisions  of  agent  i  for  any  realization  of  the  private  signals  under  the  assumption 
that  all  agents  except  %  never  exit  and  keep  transmitting  new  information.  Assume  for  sim¬ 
plicity  that  Gn  is  connected  (otherwise  apply  the  algorithm  to  the  connected  component  in 
which  agent  i  resides).  Let  t€mlj  denote  the  maximum  distance  between  i  and  an  agent  in 
Gn.  Note  that  this  implies  B?  =  Nn.  The  state  at  communication  step  t,0<t<  tendi, 
is  simply  the  number  of  private  signals  the  agent  has  observed  thus  far  and  the  empirical 
mean  for  9,  kt  and  9t  respectively.  The  algorithm  computes  the  optimal  decision  for  agent 
i  starting  from  the  last  possible  communication  step  tendA  and  working  its  way  through  the 
beginning  (t  =  0).  In  particular  (we  drop  subscript  i), 


Algorithm  1. 

(V 


OPTdec(tend,  ktend,  9tend )  =  6tend. 


and  Pay (tenii ,  ktend ,  ^tend)  —  tt  ~ 

-For  t  =  fend  —  1  to  0  do: 

(i)  Preliminaries: 

*  Number  of  new  observations  at  step  t:  £  =  |Bft+1|  — 

*  Payexit{t,kt,6t)  =  7T-  l/p2+\rp2kt- 

*  Paywai\t ,  kt,  9tj  ■=  Pay(i  +  1,  A*  +  £,  0't). 

(ii)  If  Paywait  <  (1  —  r / X)Payexit  then 

OPTdec{t ,  kt,  6t)  =  9t  and  Pay(t ,  kt,  6t)  =  tt  -  -■ ■■■  1 

1/ pz  +  1/ p^k? 

Else  OPTdec(t ,  A;t,  0t)  —  “wait”  and  Pay(t,  kt,  9t)  =  Paywait(t,  kt,  9t). 


Correctness:  Note  that  if  agent  i  has  not  exited  till  the  last  time  period,  then,  since 
there  is  no  more  observations  to  be  made  in  the  future,  she  will  exit  and  choose  the  action 
that  maximizes  her  expected  payoff  (see  Lemma  4).  For  any  other  time  period  t  agent  i’s 
expected  payoff  of  exiting  taking  an  action  at  the  current  time  period  is  given  by  Payexit. 
Finally,  the  agent  decides  to  “wait”  only  if  this  action  leads  to  higher  expected  payoff.  It 
is  straightforward  to  see  that  the  computational  complexity  of  the  algorithm  is  0(n2).  To 
relate  the  algorithm  with  computing  the  perfect  observation  radius,  note  that  the  perfect 
observation  radius  coincides  with  the  exit  time  computed  by  the  algorithm. 

Finally,  Lemma  8  states  that  the  probability  of  choosing  an  action,  that  is  more  than 
e  away  from  the  optimal,  for  agent  i,  such  that  i  6  :  i.e.,  P*  (M”’£  =  0)  is  uniformly 

bounded  away  from  0. 


Lemma  8.  Let  k  >  0  be  a  constant,  such  that  the  k-radius  set  is  non-empty.  Then, 


P a{M?’e  =  0)  >  erf 


-  erf 


for  all  i  e  Vffa, 
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where  erf(x )  =  ^  fo  e  t2  dt  ^  the  error  function. 


Proof.  The  lemma  follows  from  the  observation  that  because  of  our  normality  assumption 
the  empirical  mean  9  after  observing  £  private  signals  is  normally  distributed  around  9  with 
variance  p?  =  p2 ji.  Then,  the  probability  that  Mf'c  —  0  is  simply  equal  to  the  probability 
that  a  random  variable  X  ~  N( 0,  p2 /£)  does  not  belong  to  interval  [— e,  e],  i.e., 

The  lemma  follows  since  agent  i  <E  V£’a ,  thus  she  takes  an  irreversible  action  after  observing 
at  most  k  private  signals.  ■ 

Proof  of  Proposition  15.  First,  we  show  that  learning  fails  if  condition  (6.3)  holds,  i.e., 
there  exists  an  77  >  0,  such  that 


limsup—  ■  |VfT,<T|  >  rj.  (D.l) 

n—toc  72 

and  k  <  k  given  by  Condition  (6.3).  From  condition  (D.l)  we  obtain  that  there  exists  an 
infinite  index  set  J  such  that 


|  V^}  |  >  rj  ■  Uj  for  j  G  J. 

Now  restrict  attention  to  index  set  J,  i.e.,  consider  n  =  Uj  for  some  j  6  J .  Then, 


P„ 


„  a  E”_1  MP  >1-0  =  n>„  (i  [e,€v».»  mp  +  E <tvr  Kf\  >  1  -  £) 


<p»(1 


E,evr- Kf  +  n~  \vp 


>  1  -  € 


=  Dev-  Mr  > 


vr 


—  e 


where  the  inequality  follows  since  we  let  Mf'f  =  1  for  all  i  Vf"a .  Next  we  use  Markov’s 
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inequality  to  obtain 


-  y  M'i,e  > 

n  z-'  1 


ievr 


vr 

J 

n 

E„ 


< 


[Ei 


£Vn,<r 


Ml 


n,e 


n-  ( \vk'a\/n~e )  ' 


We  can  view  each  summand  above  as  an  independent  Bernoulli  variable  with  success  prob¬ 


ability  bounded  above  by  erf 


\Z'2P2/k 


erf  - 


y/2p2 /k 


from  Lemma  8.  Thus, 


k 

/  v  (er/(v#>) 

1  -er/( 

v  W/i2/^)) 

n* 

( yr  /—) 

—  .  / 

1  T  rn  .O 

n\vk 

\ 

/n— e 

j 

) 

5  A  (W  (vto)  - er/  (-yfe))  < 1  -  s> 


where  the  second  inequality  follows  from  the  fact  that  n  was  chosen  such  that  |  Vf  'a  |  >77-71. 
Finally,  the  last  expression  follows  from  k  <  k  (cf.  Condition  (6.3)).  We  obtain  that  for  all 
j  (z  J  it  holds  that 


PCT 


J  i=l 


>  e)  >5. 


Since  J  is  an  infinite  index  set  we  conclude  that 


lim  sup 

n— >•  00 


1  n 

1=1 


Mr) 


>  e  >5, 


thus  e,  6-asymptotic  learning  is  incomplete  when  (6.3)  holds. 

Next,  we  prove  that  Condition  (6.2)  is  sufficient  for  e,  (5-asymptotic  learning.  As  mentioned 
above,  if  agent  i  takes  an  irreversible  action  after  observing  £  signals,  then  the  probability 
that  M”’e  =  1  is  equal  to  the  probability  that  a  random  variable  X  ~  N(0,p2/£)  takes  a 
value  in  interval  [— e,e].  Thus, 


P  .(Mr  =  1)  =  erf 


-  erf 


(D-2) 
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Similarly  with  above,  we  have 


P<X 


n 


£(1  - 


i= 1 


>ei  <  p„I  l-Va -vy; 

n 

<  E,  [£,*y(l  -  VD] 

n  (e  —  V  /n) 


>  e  - 


M 


n 


(D.3) 


where  V  =  ji  |  |Brer„,<r  |  <  A:  j  and  the  second  inequality  follows  from  Markov’s  inequality. 
By  combining  Eqs.  (D.2)  and  (D.3)  and  letting  k'fu  denote  the  number  of  private  signals 
that  agent  i  observed  before  taking  an  action, 


E„ 


E*y(l  “  M?) 


J2i^v  1  ~[erf 


n(e-  \  V\/n) 


< 


y/WFF , 


erf 


FW/k7' 


n  (e  —  | V\jn) 


(D.4) 


We  have 


erf 


#7? 


erf 


>  1  - 


-  0 

1-C  : 


(D.5) 


y/W/W, 

for  all  i  f  V  from  the  definition  of  k  (cf.  Condition  (6.2)).  Thus,  combining  Eqs.  (D.3), (D.4) 
and  (D.5),  we  obtain 


1  n 

1 53  a  -  Mn 

i—  1 


>  e  )  <  S  for  all  n  >  iV, 


where  N  is  a  sufficiently  large  constant,  which  implies  that  condition  (6.2)  is  sufficient  for 
asymptotic  learning.  ■ 

Proof  of  Proposition  16.  We  show  that  although  an  agent  has  potentially  access  to 
less  information  under  Assumption  6,  perfect  asymptotic  learning  occurs  whenever  perfect 
asymptotic  learning  occurs  under  Assumption  5. 

Proposition  26.  If  perfect  asymptotic  learning  occurs  in  society  {G™}^  under  Assump¬ 
tion  5,  then  perfect  asymptotic  learning  occurs  under  Assumption  6  along  any  equilibrium 
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Agent  V s  observation  set  B] 


i,d? 


Agent  j’s  observation  set  B 


Node  i 


Node  j 


-dr 


-d]- 


Node  £ 


-dist{£,  i)  <  d'l 


-dist(£,  j)l- 


Figure  D-l:  Proof  of  Proposition  26. 


a. 


Proof.  Consider  set  U%,  where  recall  that  UJ?  is  the  set  of  agents  that  are  at  a  short 
distance  (at  most  equal  to  their  observation  radius)  from  an  agent  with  in-degree  at  least 
k.  Define  similarly  set  Z]':  rT  as  the  set  of  agents,  that  at  some  equilibrium  a,  communicate 
with  an  agent  with  in-degree  at  least  k.  Note  that  under  Assumption  5  the  sets  are  equal, 
i.e.,  UJ!  =  ZJla .  To  complete  the  proof,  we  show  that  for  k  large  enough  (and  consequently 
n  large  enough),  U[!  =  Z£a ,  even  under  Assumption  6. 

Consider  i  €  and  let  V"  =  {£,  i\,  ■  ■  ■  denote  the  shortest  path  in  communi¬ 

cation  network  Gn  between  i  and  any  agent  £,  with  defj”  >  k.  First  we  show  the  following 
(refer  to  Figure  D-l) 

i  e  u;‘  =>  j  e  UJ!  for  all  j  e  Vn.  (D.6) 

Assume  for  the  sake  of  contradiction  that  condition  (D.6)  does  not  hold  and  consider  the 
simplified  environment  under  Assumption  5  (we  are  only  looking  at  the  set  £/£,  so  we  can 
restrict  attention  to  that  environment).  Then,  let 

j  =  argmin {distn(£,j')\j'  £  Vn  and  distn(£,j’)  >  r",}. 

j/  j 

For  agents  i,  j  we  have  r J*  >  and  dist(j,i )  +  d]  <  dist(£,i )  <  rf,  since  otherwise 
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j  eU£.  This  implies  that  Ji'T.  C  B™Tn.  Furthermore, 


1/p2  +  l/p2(l  + 


on 


I) 


>  E J-e 


rt 


1/ P2  +  l/p2(l  +  \B™rn\  +  k) 


),  (D.7) 


where  t  is  the  time  for  dist(£ ,  j)  —  r"  extra  communication  steps  to  take  place.  In  particular, 
the  left  hand  side  is  equal  to  the  expected  payoff  of  agent  j  if  she  takes  an  irreversible  action 
at  time  r"  after  receiving  IT/r„  |  observations,  whereas  the  right  hand  side  is  a  lower  bound 
on  the  expected  payoff  if  agent  j  delays  taking  an  action  until  after  she  communicates  with 
agent  £.  The  inequality  follows,  from  the  definition  of  the  observation  radius  for  agent  j. 
On  the  other  hand, 

,  for  some  e  >  0, 
(D.8) 

For  k  large  enough  we  conclude  that  dist(£,j)  <  dist(£,i )  —  dist(j,i),  which  is  obviously  a 
contradiction.  This  implies  that  (D.6)  holds. 

Next  we  show,  by  induction  on  the  distance  from  agent  £  with  degree  >  k.  that  Ujk  = 
Zr;:a  for  any  equilibrium  a.  The  claim  is  obviously  true  for  all  agents  with  distance  equal  to 
0  (agent  £')  and  1  (her  neighbors).  Assume  that  the  claim  holds  for  all  agents  with  distance 
at  most  t  from  agent  £,  i.e.,  if  i  G  U[!  and  dist(£,  i)  <  t  then  i  e  Z%a.  Finally,  we  show  the 
claim  for  an  agent  i  such  that  %  G  U£  and  dist(£ ,  i)  =  t  +  1.  Consider  a  shortest  path  Vn 
from  i  to  £.  Condition  (D.6)  implies  that  all  agents  j  in  the  shortest  path  are  such  that 
j  G  Uj£,  thus  from  the  induction  hypothesis  we  obtain  j  G  Z%>(r.  Thus,  for  k  sufficiently 
large  we  obtain  that  i  G  Z^ff,  for  any  equilibrium  a. 

Finally,  from  Corollary  2  we  conclude  that  asymptotic  learning  under  Assumption  5 
implies  asymptotic  learning  under  Assumption  6.  ■ 

The  first  part  of  Proposition  16  follows  directly  from  Proposition  15  and  Proposition  26. 
To  conclude  the  proof  of  Proposition  16  we  need  to  show  that  if  asymptotic  learning  occurs 


l/P2  +  l/p2(l+!^Tn|) 

j  ’  7 


<  E0 


— e 


rt' 


l/p2  +  l/p2(l  +  \BlrA  +  k) 
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when  condition  (6.2)  does  not  hold  along  some  equilibrium  cr,  then  the  society  contains  a 
set  of  leaders.  In  particular,  consider  a  society  {6'"};^1  in  which  condition  (6.2)  does  not 
hold,  i.e.,  assume  as  in  the  proof  of  Proposition  15  that  there  exists  a  k  >  0,  e  >  0  and 
infinite  index  set  J,  such  that  >  e  •  rij  for  j  £  J.  We  restrict  attention  to  index  set 
J  and  consider  a  =  {an}!^Ll  an  equilibrium  along  which  asymptotic  learning  occurs  in  the 
society. 

Consider  a  collection  of  subsets  of  the  possible  realizations  of  the  private  signals, 
{Qn}n= n  anfl  a  collection  of  subsets  of  agents,  {Rn}%Li,  such  that: 

(i)  limn_^oo  Pa(Qn)  >  e,  for  some  e  >  0,  i.e.,  the  subsets  of  realizations  considered  have 
positive  measure  as  the  society  grows,  and  if  a  realization  is  in  Qn  its  complement  is 
also  in  Qn. 

(ii)  lim^oo  ±\Rn\  >  e. 

Such  collections  should  exist,  since  condition  (6.2)  fails  to  hold  in  the  society.  Consider 
next  equilibrium  a  and  assume  that  the  realization  of  the  private  signals  belongs  to  subset 
Qn  (for  the  appropriate  n) .  Since  asymptotic  learning  occurs  along  equilibrium  a  we  have: 


lim  —\FQ\  =  0, 

n— >oo  Tl 


where  —  {i  £  Rn\a™dn  £  {0,1}}.  However,  this  implies  that  there  should  exist  a 
collection  of  subsets  of  agents,  {.5m}(}L1,  such  that: 

(i)  Rna'c  C  S]ollow,  where  Rn/  =  {i  £  Rn\ald,,  =  “ wait ”}. 

(ii)  a”r  ^  {“wait"},  for  some  r  <  r”  -  dist(i,j )  and  i  such  that  j  £  B?rn,  i  £  Rna. 

(iii)  lim^oo  ^|5'n(  =  0,  since  otherwise  asymptotic  learning  would  not  occur  along  equi¬ 
librium  a. 

Note  that  collection  {S'n}^=1  satisfies  the  definition  of  a  set  of  leaders  [cf.  Definition  5], 
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since 


lim  sup  — - |i?n,c|  >  e 

n— >oo  ^ 

and  Proposition  16  (ii)  follows.  ■ 

Proof  of  Theorem  1. 

The  proof  of  the  theorem  relies  heavily  on  the  next  proposition,  which  intuitively  states 
that  there  is  no  incentive  to  lie  to  an  agent  with  a  large  number  of  neighbors,  assuming 
that  everybody  else  is  truthful. 

Proposition  27  (Truthful  Communication  to  a  High  Degree  Agent).  There  exists  a  scalar 
k  >  0,  such  that  truth-telling  to  agent  i,  with  indegf  >  k(e),  in  the  first  time  period  is  an 
equilibrium  of  IN FO{Gn).  Formally, 

(an’truth,mn'truth)  e  INFOiG11),. 


where  m^ffuth  =  Sj  for  j  6  . 

Proof.  The  proof  is  based  on  the  following  argument.  Suppose  that  all  agents  in  Br] 
except  j  report  their  signals  truthfully  to  i.  Moreover,  let  \B'fl  |  >  k.  where  A:  is  a  large 
constant.  Then,  it  is  an  weakly  dominant  strategy  for  j  to  report  her  signal  truthfully  to  i, 
since  f  s  message  will  not  be  pivotal  for  agent  i,  i.e.,  i  will  take  an  irreversible  action  after 
the  first  communication  step,  no  matter  what  j  reports.  ■ 

Theorem  1  follows  directly  from  Proposition  27  and  Proposition  16.  ■ 

Proof  of  Theorem  2.  The  proof  uses  similar  arguments  with  the  ones  presented  in  the 
proof  of  Proposition  26.  A  key  observation  is  that  a  social  planner  cannot  achieve  a  higher 
aggregate  welfare  than  that  achieved  by  the  agents  under  Assumption  5.  ■ 

Proof  of  Proposition  17.  The  claim  follows  by  noting  that  the  social  planner  could 
choose  the  following  strategy  profile:  for  each  j  €E  Dp]  delay  i’s  irreversible  action  by  at 
least  one  time  period,  where  i  is  an  agent  such  that  if  i  delays  then  j  gains  access  to  a  least 
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£  additional  signals.  Moreover,  it  is  straightforward  to  see  that  there  exist  e,  5  for  which 
e,  (5-learning  fails.  ■ 
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Appendix  E 


Omitted  Proofs  from  Chapter  7 


Proof  of  Proposition  18 

(i)  Let  the  discount  rate  be  r  >  0.  Then,  the  perfect  observation  radius  of  an  agent 
is  bounded  by  a  constant  Z  and,  thus,  \V^\  =  n  for  all  n  >  kz .  Thus,  perfect  asymptotic 
learning  fails  from  Proposition  16. 

(ii)  First,  we  state  a  lemma  from  [16]  for  bounding  the  probability  that  there  exists  a 
path  between  two  agents  in  a  preferential  attachment  network.  For  the  remainder,  of  the 
proof,  agent  i  is  simply  the  agent  that  was  born  at  time  i. 

Lemma  in  [16]  Let  {G™}^  be  a  preferential  attachment  sequence  of  communication 
networks  and  consider  network  Gn,  for  n  large  enough.  Let  P  be  a  path  from  agent  i  to 
agent  j  in  Gn,  where  Sn{P)  denotes  the  set  of  edges  of  path  P.  Then, 

wip  c  g”)  <  c|£"(p)l  TT  -1=, 

where  C  is  an  absolute  constant. 

This  lemma  allows  us  to  consider  a  random  communication  network,  where  an  edge  between 
agents  i  and  j  is  present  independently  with  probability  1/s/i],  although  in  the  original 
network  Gn  edges  are  dependent.  The  intuition  why  such  a  result  is  true  comes  from 
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observing  that  the  dependencies  between  edges  due  to  the  preferential  attachment  rule  are 
not  too  strong  as  the  communication  network  grows. 

We  are  now  ready  to  proceed  with  the  proof.  Let  wn  be  the  average  degree  for  Gn ,  which 
by  the  preferential  attachment  rule  remains  finite  with  high  probability  as  n  grow's,  i.e., 
lim^^  wn  <  W.  In  particular,  for  sufficiently  large  n, 


P(at  most  n/c  nodes  have  degree  more  than  cW)  >  1  —  e/2,  for  all  c  >  0.  (E.l) 


Next,  we  use  the  above  lemma  to  show  that  most  low  degree  agents  agents  are  not  connected 
with  a  short  path  to  high  degree  agents.  In  particular,  let  S}  (S2)  denote  the  set  of  agents 
that  have  degree  greater  (smaller)  than  cW.  We  want  to  bound  the  probability  that  there 
exists  a  short  path  (at  most  L  links  long)  from  an  agent  in  S2  to  an  agent  in  6) .  As 
mentioned  above,  with  high  probability  |,S’i|  <  n/c.  Then,  if  P[  denotes  a  path  of  length  £ 
from  agent  k  6  S2  to  any  agent  in  Si,  we  have 


End  e  g”)  <  £  £ 


ct 


E  IIv 


i=i 


t=\  jeSi  ‘  3  Ul)...;U<_lGs2 1= l  Ut 


£=i  jes,  J  ies2  1 


^E  E 


ce 


E  7 


(—1  j€{l,—  ,n/c}  ^  ^  i£{n/c,---  ,n} 


<  C  <  1)  for  agents  with  k  =  O(n), 


(E.2) 


(E.3) 

(E.4) 


where  Eq.  (E.2)  follows  from  Lemma  in  [16]  and  Eq.  (E.3)  follows  by  setting  set  Si  to  be 
the  set  of  the  first  n/c  nodes  (lowest  indices)  and  set  S2  the  rest.  Finally,  Eq.  (E.4)  follows 
for  an  appropriate  choice  of  constant  c  (and  consequently  the  size  of  set  Si).  In  particular, 
as  n  grows,  we  obtain  £*£{«/*,..,«}  f  ~  logo  and  £je{1  /c}  ^  for  k  =  0{n). 


Finally,  note  that  with  high  probability  most  of  the  agents  with  index  0(n)  belong  to  set 
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S2,  since  by  Lemma  in  [16]  above  their  expected  degree  is  bounded  by  a  small  constant. 
Combining  Eqs.  (E.l),  (E.4)  and  Corollary  2  we  conclude  that  perfect  asymptotic  learning 
fails  in  a  preferential  attachment  communication  network  sequence  with  probability  at  least 
1  —  e  for  discount  rate  r  >  0. 

(iii)  Follows  directly  from  (ii),  since  Erdos-Renyi  communication  networks  are  even 
sparser  than  preferential  attachment  communication  networks. 


Proof  of  Proposition  19 

(i)  -  (ii)  The  claims  follow  directly  from  Proposition  16.  In  particular,  for  r  <  fx  (for 
complete)  and  r  <  f2  (for  star),  where  fx,f2  such  that: 

%  ~  1/p2  +  1/p2  <  ^A  +  fh  and  ^  "  1/P2  +  1/P2  <  77  (x+fa) 
we  obtain  that 

lim  lim  —  ■  \  V£\  =0, 

k— >oc  n— too  Tl 

therefore  perfect  asymptotic  learning  occurs  in  the  respective  societies. 

(iii)  Follows  directly  from  [70]. 

(iv)  Consider  the  following  two  events  A  and  B. 

Event  A:  Layer  1  (the  top  layer)  has  more  than  k  agents,  where  k  >  0  is  a  scalar. 

Event  B:  The  total  number  of  layers  is  more  than  k. 

From  the  definition  of  a  hierarchical  sequence  of  communication  networks,  we  have 

w=n(i-i^)<»p(-E^)-  (E-5) 

Also, 

(B.6) 

i—2 
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from  Markov’s  inequality,  where  £  is  a  random  variable,  that  denotes  the  number  of  layers 
in  the  hierarchical  society.  Let  £(e)  be  small  enough  and  k  (and  consequently  n)  large 
enough  such  that  Y!L2  prc  >  log  7  and  prc  <  X-  For  those  values  of  C  and  k  we 
obtain  P(A)  <  e/4  and  P(Z?)  <  e/4.  Next,  consider  the  event  C  =  Ac  D  Bc,  which  from 
Eqs.  (E.5)  and  (E.6)  has  probability  P(C)  >  1  -  e/2  for  the  values  of  (  and  k  chosen  above. 
Moreover,  we  consider 

Event  D:  The  agents  on  the  top  layer  are  information  hubs,  i.e. , 


lim  | |  =  00,  for  all  i  £  A/?. 

n— s-o o  ’ 

We  claim  that  event  D  occurs  with  high  probability  if  event  C  occurs,  i.e.,  P(Z)|  C)  > 
1  —  e/2,  which  implies 


P(C  n  D)  =  P(D|  C)P(C)  >  (1  -  e/2)2  >  1  -  e. 


(E.7) 


In  particular,  note  that  conditional  on  event  C  occurring,  the  total  number  of  layers  and  the 
total  number  of  agents  in  the  top  layer  is  at  most  k.  From  the  definition  of  a  hierarchical 
society,  agents  in  layers  with  index  i  >  1  have  an  edge  to  a  uniform  agent  that  belongs  to 
a  layer  with  lower  index,  with  probability  one.  Therefore,  if  we  denote  the  degree  of  an 
agent  in  a  top  layer  (say  agent  1)  by  we  have 


PJ* 


/7"'n 
' 2 

Eq-level2 


i= 1 


TP 


+ 


+  E2S 


level  C 


i—  1 


(E.8) 


where  7)n  denotes  the  random  number  of  agents  in  layer  i  and  is  an  indicator  variable 
that  takes  value  one  if  there  is  an  edge  from  agent  i  to  agent  1  (here  level  j  simply  denotes 


that  agent  i  belongs  to  level  j).  Again  from  the  definition,  we  have  p(/^l’eb  =  1)  = 


il  r'j-l  rn  ) 

^l-i  ' i 


where  the  sum  in  the  denominator  is  simply  the  total  number  of  agents  that  lie  in  layers 
with  lower  index,  and  finally,  7(n  +  •  ■  ■  T£  —  n. 


142 


We  can  obtain  a  lower  bound  on  the  expected  degree  of  an  agent  in  the  top  layer 
conditional  on  event  C  by  viewing  (E.8)  as  the  following  optimization  problem: 


where  we  make  use  of  the  fact  that  the  total  number  of  layers  is  bounded  by  k,  since  we 
condition  on  event  C.  By  solving  the  problem  we  obtain  that  the  objective  function  is 
lower  bounded  by  (j>{n ),  where  4>(n)  =  0(n1//fc)  for  every  n.  Then, 


E[D^\C\  = 

k 

=  E  E  P(T  =  £,7r  =  ,7?  =  kt\C)-E[V?\C,£  =  e,7?  =  ku<--  ,7 7  =  ke] 

i= 2  k\  <&,-•• 
ki+-“+k{>=n 

k 

>E  E  P(T  =  £,77  =  £;!,■■•  ,l?  =  ke\C)-<j>(n)  =  (f>(n),  (E.9) 

l=-'2  k\<k,-- ,k( 
fclH - h  k(=n 

where  Eq.  (E.9)  follows  since  E[Vf\C,C  =  £,7^n  =  fci,  •  •  •  ,Ten  =  kt\  >  <f>(n)  for  all  values 

of  £  (2  <  £  <  k)  and  h,  -  ■■  ,k£  (ki  <  k,  kx  H - 1 -  ke  =  n)  from  the  optimal  solution  of 

the  optimization  problem.  The  same  lower  bound  applies  for  all  agents  in  the  top  layer. 
Similarly  we  have  for  the  variance  of  the  degree  of  an  agent  in  the  top  layer  (we  use 
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£,  ki,  ■  ■  -  ,  k(_  as  a  shorthand  for  C  =  A,  T™  =  Aq,  •  •  ■  =  Aq) 


k 

Var[V^\C\  =  J2  E  P^’  M W  •  ^ar^jC,  £,  Aq,  •  •  •  ,  A:,] 

^=2  ki<k,---  ,k(i 

fciH - ffc£=n 

k 

=  E  E  P(f’  • '  •  -  •  (^Var(/'7e'2)  +  ■  ■  ■  +  Aqfoar(/“))  (E.10) 

<?=1  fcl  <<!,■■• 

fclH - hfcf=,re 

k 

<  E  E  P^>  fei«  ■  •  •  > **!<?)  •  (^E(/'Td2)  +  •  ■  -  +  A*E(J?r"))  =  E[Z?|C], 

£=1  ,ke 

k\-\ - f -ki=n 

(E.ll) 


where  Eq.  (E.10)  follows  by  noting  that  conditional  on  event  C  and  the  number  of  layers 
and  the  agents  in  each  layer  being  fixed,  the  indicator  variables  (defined  above)  are  inde¬ 
pendent  and  Eq.  (E.ll)  follows  since  the  variance  of  an  indicator  variable  is  smaller  that 
its  expectation.  We  conclude  that  the  variance  of  the  degree  is  smaller  than  the  expected 
value  and  from  Chebyschev’s  inequality  we  conclude  that 


p(D)-p(n  ^T>o>i 


e/2, 


where  C  >  0,  i.e. ,  with  high  probability  all  agents  in  the  top  layer  are  information  hubs 
(recall  that  lim^oo  4>{n)  =  oo). 

We  have  shown  that  when  event  C  D  D  occurs,  there  is  a  path  of  length  at  most  k  (the 
total  number  of  layers)  from  each  agent  to  an  agent  at  the  top  layer,  i.e.,  an  information 
hub  with  high  probability.  Therefore,  if  the  discount  rate  r  is  greater  than  some  lower 
bound  (r  >  f4),  then  perfect  asymptotic  learning  occurs.  Finally,  we  complete  the  proof 
by  noting  that  F(C  fl  D)  >  (1  —  e/2)2  >  1  —  e. 
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Appendix  F 


Omitted  Proofs  from  Chapter  8 

Proof  of  Theorem  3 

First  we  make  an  observation  which  will  be  used  frequently  in  the  subsequent  analysis. 
Consider  an  agent  i  such  that  H"r[i)  G  7C~,  where  k  is  an  integer  appropriately  chosen  (see 
below),  i.e,.  the  size  of  the  social  clique  of  agent  i  is  greater  than  or  equal  to  fc,  \H™C^\  >  k. 
Suppose  agent  i  does  not  form  a  link  with  cost  c  with  any  agents  outside  her  social  clique. 
If  she  makes  a  decision  at  time  t  =  0  based  on  her  signal  only,  her  expected  payoff  will 
be  it  —  ljp2+i/p2  ■  If  she  waits  for  one  period,  she  has  access  to  signals  of  all  the  agents 
in  her  social  clique  (i.e.,  she  has  access  to  at  least  k  signals),  implying  that  her  expected 
payoff  would  be  bounded  from  below  by  ^  ^  .  Hence,  her  expected  payoff 

Ep^g”)]  satisfies 

EWs")]  >  max  {*  -  v/+w.  TTa  “  1/^  +  1  /fk)  }  ’ 

for  any  link  formation  strategy  gn  and  along  any  a  G  INFO(Gn)  (where  Gn  is  the  com¬ 
munication  network  induced  by  gn). 

Suppose  now  that  agent  i  forms  a  link  with  cost  c  with  an  agent  outside  her  social 
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clique.  Then,  her  expected  payoff  will  be  bounded  from  above  by 


E[IIj(gn)]  <  max  <  ix  — 


A 


1/p2  +  1/p2’  \A  +  r 


7T  —  C 


where  the  second  term  in  the  maximum  is  an  upper  bound  on  the  payoff  she  could  get  by 
having  access  to  signals  of  all  agents  she  is  connected  to  in  two  time  steps  (i.e.,  signals  of 
the  agents  in  her  social  clique  and  in  the  social  clique  that  she  is  connected  to).  Combining 
the  preceding  two  relations,  we  see  that  an  agent  i  with  e  Wf.  will  not  form  any 

costly  links  in  any  network  equilibrium,  i.e., 


p"  =  1  if  and  only  if  sc{j )  =  sc(i)  for  all  i  such  that  \H™c(ij\  >  k 


(F.l) 


for  k  such  that 


A 


7T 


A 


> 


A 


7T  —  C. 


r  +  A  \  1/p2  +  1/p2 k J  \X  +  r/ 

(a)  Condition  (8.2)  implies  that  for  all  sufficiently  large  n,  we  have 


\n\  >  ?«, 


(F.2) 


where  £  >  0  is  a  constant.  For  any  e  with  0  <  e  <  we  have 


.  2=1 


=  P 


> 


^  1  -  1  -  M„n’e 

E  E 


\  \HU)\<k 


( 


V-  1  -  M”’e 

E  — ->e 


*1  I^V>* 

\ 


n 


yi 


n 


1 


\ 


>  e 


/ 


(F.3) 
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The  right-hand  side  of  the  preceding  inequality  can  be  re-written  as 


m  1  -  AiT 

P  |  2^  - —  >  e 


\ 


‘I  I'Cwli* 


n 


) 


1  - 


=  1  -  P 


^  i  _  Mn,e  ^ 

J2  1  .^L_<e 


VI  \H?c(*)\>k 

E 

<1  l^c(i)l>fc 


n 


m: 


>  r  —  e 


n 


f.  By  Eq.  (F.2),  it  follows  that  for  n  sufficiently  large,  we  have 
r  >  £.  Using  Markov’s  inequality,  the  preceding  relation  implies 


where  r  =  £\, 


P 


1  -  Mn'c 


j 

T 

^  _  n 

V  i^(i)i^fc 


>  e  >  1  - 


n 


r  —  e 


(F.4) 


By  Lemma  8  and  observation  (F.l),  E[A/",f  j  for  an  agent  i  with  >  k  is  upper 

bounded  by 


P(MVe  =  0)  >  erf 


-  erf 


6 


> 


and  therefore 


E[M"’e]  <  1  - 


Using  this  bound  and  assuming  without  loss  of  generality,  that  social  cliques  are  ordered 
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by  size  (Hf  is  the  biggest),  we  can  re-write  Eq.  (F.4)  as 


/ 


P 


E 


1  -  MV 


>  e 


V~i  \HWk 


n 


> 


) 


E:TI^I  1  -  erf 


-  erf 


>  1 - 

(r  —  e)  •  n 

r  •  (1  —  C)  f  ■  (1  —  0 

>  1 - - - ^  >  1  -  ^-4 - —  >  8 


y/WWfl 


r  —  e 


f  -  e 


(F.5) 


Here,  the  second  inequality  is  obtained  since  the  largest  value  for  the  sum  is  achieved  when 
all  summands  are  equal  and 


C  =  erf  (  —tE?  )  -  erf  ' 


\j2p2k 


VWk  ' 


The  third  inequality  holds  using  the  relation  r  >  £  and  choosing  appropriate  values  for  e,  8. 


This  establishes  that  for  all  sufficiently  large  n,  we  have 


WE1^>e]><5>o, 


,  i—  1 


n 


which  implies 


^  1  _  Turn 

lim  sup  P  ^ - -  >  e  )  >  S', 


n—Kx>  \  n 

\  J=1 


and  shows  that  perfect  asymptotic  learning  does  not  occur  in  any  network  equilibrium. 


(b)  We  show  that  if  the  communication  cost  structure  satisfies  condition  (8.3),  then 
asymptotic  learning  occurs  in  all  network  equilibria  ( g,a )  =  {{gn,an})Vi-  For  an  il¬ 
lustration  of  the  resulting  communication  networks,  when  condition  (8.4)  holds,  refer  to 
Figure  8-2 (a).  Let  B™(Gn)  be  the  neighborhood  of  agent  i  in  communication  network  Gn 
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(induced  by  the  link  formation  strategy  gn) 


B™(Gn)  =  {]  there  exists  a  path  V  in  Gn  from  j  to  i}, 


i.e.,  B’l{Gn)  is  the  set  of  agents  in  Gn  whose  information  agent  i  can  acquire  over  a  suffi¬ 
ciently  large  (but  finite)  period  of  time. 


We  first  show  that  for  any  agent  i  such  that  limsup^^  |  H™(l)  |  <  k,  her  neighborhood 
in  any  network  equilibrium  satisfies  limn^oo  | B™\  =  oo.  We  use  the  notion  of  an  isolated 
social  clique  to  show  this.  For  a  given  n,  we  say  that  a  social  clique  H}1  is  isolated  (at  a 
network  equilibrium  ( g ,  a))  if  no  agent  in  ///'  forms  a  costly  link  with  an  agent  outside  H}1 
in  (g,  a).  Equivalently,  a  social  clique  H ™  is  not  isolated  if  there  exists  at  least  one  agent 
j  £  //" .  such  that  j  incurs  cost  c  and  forms  a  link  with  an  agent  outside  H™ . 

We  show  that  for  an  agent  i  with  limsup^^  |  B"c(jj  |  <  k,  the  social  clique  H”c(i}  is  not 
isolated  in  any  network  equilibrium  for  all  sufficiently  large  n.  Using  condition  (8.3),  we 
can  assume  without  loss  of  generality  that  social  cliques  are  ordered  by  size  from  largest 
to  smallest  and  that  limn^>oc  j H ]'  \  =  oo.  Suppose  that  is  isolated  in  a  network 

equilibrium  (g,a).  Then  the  expected  payoff  of  agent  i  is  upper  bounded  (similarly  with 
above) 

E[IU(<f )]  <  max  |tt  -  1/p2  +  1/^2’  r  +  A  “  l/fP  +  l/p*(k-l))  } 


Using  the  definition  of  A:,  it  follows  that  for  some  e  >  0, 


E[IU(3n)]  <  max 


1 

1/p2  +  1/p2  ’ 


(F.6) 


Suppose  next  that  agent  i  forms  a  link  with  an  agent  j  £  HT{ .  Her  expected  payoff 
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E[IIi(gn)]  satisfies 


E[W)]  > 


1/p2  +  1/p2  H? 


-  c, 


since  in  two  time  steps,  she  has  access  to  the  signals  of  all  agents  in  the  social  clique  H™. 
Since  lim^oo  |  Hx\  =  oo,  there  exists  some  Ah  such  that 


E[llj(pn)]  >  ^  ^  ^  7r  —  c  —  e  for  all  n  >  Nx. 

Comparing  this  relation  with  Eq.  (F.6),  we  conclude  that  under  the  assumption  that  r  <  f 
(for  appropriate  fi),  the  social  clique  is  not  isolated  in  any  network  equilibrium  for 

all  n>  N\. 


Next,  we  show  that  lmp,,^^  | B"  |  =  oo  in  any  network  equilibrium.  Assume  to  arrive  at 
a  contradiction  that  lim supn^i0O  \Bf\  <  oo  in  some  network  equilibrium.  This  implies  that 
lim supn_foo  \B?\  <  | ///|  for  all  n  >  N2  >  Nx.  Consider  some  n  >  N2.  Since  H™c{i)  is  not 
isolated,  there  exists  some  j  e  H"c(l)  such  that  j  forms  a  link  with  an  agent  h  outside  Hnsc{{). 
Since  lim supTl_^00  \B?\  <  \H?\,  agent  j  can  improve  her  payoff  by  changing  her  strategy  to 
Ojh  —  0  and  9jh'  ~  1  f°r  h'  £  A  j' ,  i.e. ,  j  is  better  off  deleting  her  existing  costly  link  and 
forming  one  with  an  agent  in  social  clique  H\l.  Hence,  for  any  network  equilibrium,  we 
have 

^lirn  |£C  |  =  oc  for  all  i  with  lim  sup  \H”c(i)  \  <  k  (F.7) 


We  next  consider  the  probability  that  a  non-negligible  fraction  (e-fraction)  of  agents 
takes  an  action  that  is  at  least  e-away  from  optimal  with  probability  at  least  6  along  a 
network  equilibrium  ( g,a ).  For  any  n ,  we  have  from  Markov’s  inequality 


1  -  Aff  ■« 
n 


e 


•E 


e[i  -  Mf’£; 

n 


(F.8) 
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We  next  provide  upper  bounds  on  the  individual  terms  in  the  sum  on  the  right-hand  side. 
For  any  agent  i.  we  have 


E[1  -  <  erf 


v'WWi 


-  erf 


V^P W 


(F.9) 


Consider  an  agent  i  with  limsupn_>.0O  \Hfc^\  <  k  (i-e..  <  k  for  all  n  large).  By 

Eq.  (F.7),  we  have  lim^oo  \Bf  \  =  oo.  Together  with  Eq.  (F.9),  this  implies  that  for  some 
C  >  0,  there  exists  some  N  such  that  for  all  n  >  N ,  we  have 


e[i  -  mi; 


< 


eC 


for  all  i  with 


limsup  |  bf |  <  k. 

n— >-00 


(F10) 


Consider  next  an  agent  i  with  limsupn_+oc  \Hfc^\  >  k ,  and  for  simplicity,  let  us  assume 
that  the  limit  exists,  i.e.,  lim^oo  | |  >  A;.1  This  implies  that  | Hfc^\  >  k  for  all  large 
n,  and  therefore, 


E 


E[1 


Mf’e; 


< 


i  |  limsupn 


n 


< 


where  the  first  inequality  follows  from  Eq.  (F.9).  Using  condition  (8.3),  i.e.,  lim^oo  = 
0,  this  relation  implies  that  there  exists  some  N  such  that  for  all  n  >  N,  we  have 


E 


i\  limsup.^^  \H™c(i)\>k 


E[1  —  M”’e]  ^  eC 
n  <  2  ' 


(F.ll) 


1The  case  when  the  limit  does  not  exist  can  be  proven  by  focusing  on  different  subsequences.  In 
particular,  along  any  subsequence  A/i  such  that  limrj_>0O!TJ6jty'i  l^cWl  >  k,  the  same  argument  holds.  Along 
any  subsequence  A,  with  limn_+0O>ne.A/'i  l-^sC(i)l  <  ^  we  can  use  an  argument  similar  to  the  previous  case 
to  show  that  limn^oo.ne.V,  | \  —  00,  and  therefore  E[1  —  <  ~f  for  n  large  and  n  €  AC 
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Combining  Eqs.  (F.10)  and  (F.ll)  with  Eq.  (F.8),  we  obtain  for  all  n  >  max  {TV,  N}  j 


<c. 


.  i= 1 


n 


where  (  >  0  is  an  arbitrary  scalar.  This  implies  that 


ih" F  ( 5E  - 1  =  o, 


oo  V  z — '  Tl 

,  i—1 


for  all  e,  showing  that  perfect  asymptotic  learning  occurs  along  every  network  equilibrium. 


(c)  The  proof  proceeds  in  two  parts.  First,  we  show  that  if  condition  (8.4)  is  satisfied, 
learning  occurs  in  at  least  one  network  equilibrium  (g,a).  Then,  we  show  that  there 
exists  a  c  >  0,  such  that  if  c  <  c,  then  learning  occurs  in  all  network  equilibria.  We 
complete  the  proof  by  showing  that  if  c  >  c,  then  there  exist  network  equilibria,  in  which 
asymptotic  learning  fails,  even  when  condition  (8.4)  holds.  We  consider  the  case  when 
agents  are  patient,  i.e. ,  the  discount  rate  r  — >  0.  We  consider  k.  such  that  c  > 
and  c  <  i/p2+1jp2^-i)  ~~  e' •  f°r  some  e'  >  0  (such  a  k  exists).  Finally,  we  assume  that 
c  <  l/p2l+i/-p2 ;  since  otherwise  no  agent  would  have  an  incentive  to  form  a  costly  link. 

Part  1:  We  assume,  without  loss  of  generality,  that  social  cliques  are  ordered  by  size 
(Hi  is  the  smallest).  Let  denote  the  set  of  social  cliques  of  size  less  than  k,  i.e., 
=  {H™,i  —  1, . . . ,  Kn  |  | if"  |  <  k}.  Finally,  let  rec(j)  and  send(j)  denote  two  special 
nodes  for  social  clique  if",  the  receiver  and  the  sender  (they  might  be  the  same  node).  We 
claim  that  (gn,<Jn)  described  below  and  depicted  in  Figure  8-2(b)  is  an  equilibrium  of  the 
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network  learning  game  r(Cn)  for  n  large  enough  and  S  sufficiently  close  to  one. 


1  if  sc(i)  =  sc(j ),  i.e. ,  i,j  belong  to  the  same  social  clique, 
1  if  i  =  rec(l  —  1)  and  j  =  send(£ )  for  1  <  £  <  |H^|, 

1  if  i  =  rec(|'H”^|)  and  j  =  send(l), 

0  otherwise 


and  <jn  €  IN FO(Gn),  where  G"  is  the  communication  network  induced  by  gn.  In  this 
communication  network,  social  cliques  with  size  less  than  k  are  organized  in  a  directed 
ring,  and  all  agents  i,  such  that  <  k  have  the  same  neighborhood,  i.e.,  B™  =  Bn  for 

all  such  agents. 

Next,  we  show  that  the  strategy  profile  ( gn ,  an)  described  above  is  indeed  an  equilibrium  of 
the  network  learning  game  T{Cn).  We  restrict  attention  to  large  enough  n’s.  In  particular, 

[HN-\  — 

let  N  be  such  that  I I  >  k  and  consider  any  n  >  N  (such  N  exists  from  condition 

(8.4)).  Moreover,  we  assume  that  the  discount  rate  is  sufficiently  close  to  zero.  We  consider 
the  following  two  cases. 

Case  1:  Agent  i  is  not  a  connector.  Then,  <7”  =  1  if  and  only  if  sc(j )  =  sc(i).  Agent 
i’s  neighborhood  as  noted  above  is  set  Bn ,  which  is  such  that  it  —  >  tt  —  c 

from  the  assumption  on  n,  i.e.,  n  >  N ,  where  N  such  that  I H?  I  >  k.  Agent  i  can 

communicate  with  all  agents  in  Bn  in  at  most  |7-f<^|  communication  steps.  Therefore,  her 
expected  payoff  is  lower-bounded  by 

EltMs")]  >  (xtv)  1  <J  ■  (»  -  1  JfTim)  >7r~c’ 


under  any  equilibrium  an  for  r  sufficiently  close  to  zero.  Agent  i  can  deviate  by  forming 
a  costly  link  with  agent  m,  such  that  sc(m)  ^  sc(i).  However,  this  is  not  profitable  since 
from  above  her  expected  payoff  under  ( gn,on )  is  at  least  tt  —  c  (which  is  the  maximum 
possible  payoff  if  an  agent  chooses  to  form  a  costly  link). 
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Case  2:  Agent  i  is  a  connector,  i.e.,  there  exists  exactly  one  j,  such  that  sc(j)  ^  sc(i )  and 
9?j  =  1-  Using  a  similar  argument  as  above  we  can  show  that  it  is  not  profitable  for  agent 
i  to  form  an  additional  costly  link  with  an  agent  m,  such  that  sc(m )  ^  sc{i).  On  the  other 
hand,  agent  %  could  deviate  by  setting  gV-  =  0.  However,  then  her  expected  payoff  would 
be 


E[ni(5n)]  =  max 

<  max  /  7r 


A 


l/p2  +  l/p2’r  +  A 

1 _ A _ 

1/p2  +  1/p2  ’  r  +  A 


7T  — 


7 T 


1/p2  +  l/p2\H? 

_ 1 _ 

1/p2  +  l/p2(k  -  1) 


(F.  12) 


<7 r  —  c  —  e 


< 


A 


r  +  A 


n 


<k. 


IT  — 


i/p2  +  i/p2\Br- 


—  c  —  e, 


for  discount  rate  sufficiently  close  to  zero.  Therefore  deleting  the  costly  link  is  not  a 
profitable  deviation.  Similarly  we  can  show  that  it  a  (weakly)  dominant  strategy  for  the 
connector  not  to  replace  her  costly  link  with  another  costly  link. 

We  showed  that  ( gn,an )  is  an  equilibrium  of  the  network  learning  game.  Note  that 
we  described  a  link  formation  strategy,  in  which  social  cliques  connect  to  each  other  in  a 
specific  order  (in  increasing  size).  There  is  nothing  special  about  this  ordering  and  any 
permutation  of  the  first  \H^\  cliques  is  an  equilibrium  as  long  as  they  form  a  directed 
ring.  Finally,  any  node  in  a  social  clique  can  be  a  receiver  or  a  sender. 

Next,  we  argue  that  asymptotic  learning  occurs  in  network  equilibria  (g,  a)  =  {(<?”,  <rri)}^=1, 
where  for  all  n  >  N,  N  is  a  large  constant,  gn  has  the  form  described  above.  As  shown 
above,  all  agents  i  for  which  <  k  have  the  same  neighborhood,  which  we  denoted 

by  Bn.  Moreover,  lim,,^^  \Bn\  =  oo,  since  all  social  cliques  with  size  less  than  k  are  con¬ 
nected  to  the  ring  and,  by  condition  (8.4),  lim^oo  )T\|  |Jf„|<jt  \H/\  =  oo.  For  discount  rate 
r  sufficiently  close  to  zero  and  from  arguments  similar  to  those  in  the  proof  of  part  (b),  we 
conclude  that  asymptotic  learning  occurs  in  network  equilibria  (g,  a). 

Part  2:  We  have  shown  a  particular  form  of  network  equilibria,  in  which  asymptotic  learn- 
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HI  •-*#—  • 

I 

H£ 

(a)  Deviation  for  i  G  H\\  -  property  (i).  (b)  Deviation  for  i  e  H}1  -  property  (ii). 

Figure  F-l:  Communication  networks  under  condition  (8.4). 

ing  occurs.  The  following  proposition  states  that  for  discount  rate  sufficiently  close  to  zeio 
network  equilibria  fall  in  one  of  two  forms. 

Proposition  28.  Let  Assumptions  6,  8  and  condition  (8.\)  hold.  Then,  an  equilibrium 
( gn,on )  of  the  network  learning  game  r(Cn)  can  be  in  one  of  the  following  two  forms. 

(i)  (Incomplete)  Ring  Equilibrium:  Social  cliques  with  indices  {1,  •  ■  •  ,j},  where 

j  <  \Hn<Jc\,  form  a  directed  ring  as  described  in  Part  1  and  the  rest  of  the  social 
cliques  are  isolated.  We  call  those  equilibria  ring  equilibria  and,  in  particular,  a  ring 
equilibrium  is  called  complete  if  j  =  he.,  if  all  social  cliques  with  size  less  than 

k  are  not  isolated. 

(ii)  Directed  Line  Equilibrium:  Social  cliques  with  indices  {1,  •  •  *  ,j},  where  j  < 
\Hn-k\,  and  clique  with  index  \H^n\  (the  largest  clique)  form  a  directed  line  with  the 
latter  being  the  endpoint.  The  rest  of  the  social  cliques  are  isolated. 

Proof.  Let  ( gn,an )  be  an  equilibrium  of  the  network  learning  game  r(Cn).  Monotonicity 
of  the  expected  payoff  as  a  function  of  the  number  of  signals  observed  implies  that  if  clique 
is  not  isolated,  then  no  clique  with  index  less  than  t  is  isolated  in  the  communication 
network  induced  by  g11.  In  particular,  let  conn(£)  be  the  connector  of  social  clique  Hf 
and  E[nconn(f)(#n)]  be  her  expected  payoff.  Consider  an  agent  i  such  that  sc(i)  =  H  <  t 
and,  for  the  sake  of  contradiction,  Hf,  is  isolated  in  the  communication  network  induced 
by  gn.  Social  cliques  are  ordered  by  size,  therefore,  \Hf,\  <  \Hf\.  At  this  point,  we  use  the 
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monotonicity  mentioned  above.  Consider  the  expected  payoff  of  agent  i: 


E[n;  ((?"•)]  =  max  1 71-  — 
<  max  <f  7r  — 
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A 
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1/p2  +  1/p2’  A  +  r 

1  A 


1/p2  +  1/p2  ’  A  + 


7T 


7T  — 


1/p2  +  l/p2\H?,\ 

1 

i/p2  +  i/p2|if;| 


<  E[nconn(^(pn)], 

(F.13) 


where  the  last  inequality  follows  from  the  fact  that  agent  conn(£ )  formed  a  costly  link. 
Consider  a  deviation,  g^deviation  for  agent  ;  in  which  qn'devi?*on  =  1  and  qGdeviation  =  Qn 

u  1  0  3  y%,conn\t)  Vij  Vij  ? 

i.e.,  agent  i  forms  a  costly  link  with  agent  conn{£).  Then, 

ml(9n'iem“,on)}  >  jA-  E[ncm„,n(s»)]  >  Eplits”)], 


from  (F.13)  and  for  discount  rate  sufficiently  close  to  zero.  Therefore,  social  clique  H$  will 
not  be  isolated  in  any  network  equilibrium  (gn,  an). 

Next,  we  show  two  structural  properties  that  all  network  equilibria  ( gn,an )  should 
satisfy,  when  the  discount  rate  r  is  sufficiently  close  to  one.  We  say  that  there  exists  a  path  V 
between  social  cliques  and  Hg,  if  there  exists  a  path  between  some  i  6  H]/  and  j  e  H?  . 
Also,  we  say  that  the  in-degree  (out-degree)  of  social  clique  II}/  is  k,  if  the  sum  of  in- links 
(out-links)  of  the  nodes  in  H}/  is  k,  i.e.,  H}/  has  in-degree  k  if  Y.Huy  9%  =  k. 

(i)  Let  H}/ ,  H}/  be  two  social  cliques  that  are  not  isolated.  Then,  there  should  exist  a 
directed  path  V  in  G"  induced  by  gn  between  the  two  social  cliques. 

(ii)  The  in-degree  and  out-degree  of  each  social  clique  is  at  most  one. 

Figure  F-l  provides  an  illustration  of  why  the  properties  hold  for  patient  agents.  In  par¬ 
ticular,  for  property  (i),  let  i  =  conn(H £)  and  j  =  conn{H/2)  and  assume,  without  loss 
of  generality,  that  \D}1\  <  \B/\.  Then,  for  discount  rate  sufficiently  close  to  zero  and  from 
monotonicity  of  the  expected  payoff,  we  conclude  that  i  has  an  incentive  to  deviate,  delete 
her  costly  and  form  a  costly  link  with  agent  j.  Property  (ii)  follows  due  to  similar  argu- 
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merits. 


From  the  above,  we  conclude  that  the  only  two  potential  equilibrium  topologies  are  the 
(incomplete)  ring  and  the  directed  line  with  the  largest  clique  being  the  endpoint  under 
the  assumptions  of  the  proposition.  ■ 

So  far  we  have  shown  a  particular  form  of  network  equilibria,  that  arise  under  condition 
(8.4),  in  which  asymptotic  learning  occurs.  We  also  argued  that  under  condition  (8.4)  only 
(incomplete)  ring  or  directed  line  equilibria  can  arise  for  network  learning  game  T(Cn).  In 
the  remainder  we  show  that  there  exists  a  bound  c  >  0  on  the  common  cost  c  for  forming 
a  link  between  two  social  cliques,  such  that  if  c  <  c  all  network  equilibria  (g.  a)  that  arise 
satisfy  that  gn  is  a  complete  ring  equilibrium  for  all  n  >  N,  where  N  is  a  constant.  In  those 
network  equilibria  asymptotic  learning  occurs  as  argued  in  Part  1.  On  the  other  hand,  if 
c  >  c  coordination  among  the  social  cliques  may  fail  and  additional  equilibria  arise  in  which 
asymptotic  learning  does  not  occur.  Let 

n  •  (  1  1 

c  =  mm  < - - 1 - — - - ,  - - — 

*  1  i/p2  +  i/p2(E‘,,|ff"l  +  W+1l)  i/p2  +  i/P2|tf?+il) 

where  kx  <  k  <  j and  YljLi  I Hj  I  >  WkA  (size  °f  the  largest  social  clique).  Moreover, 
let 

c  =  lim  inf  cn. 

n—¥  o© 

Then, 

Proposition  29.  Let  Assumptions  6,  8  and  condition  (8.4)  hold.  If  c  <  c  asymptotic 
learning  occurs  in  all  network  equilibria  ( g,a ).  Otherwise,  there  exist  equilibria  in  which 
asymptotic  learning  does  not  occur. 

Proof.  Let  the  common  cost  c  be  such  that  c  <  c,  where  c  is  defined  as  above,  and 
consider  a  network  equilibrium  (g,  a).  Let  fV  be  a  large  enough  constant  and  consider  the 
corresponding  gn  for  n  >  N.  We  claim  that  gn  is  a  complete  ring  equilibrium  for  all  such  n. 
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Assume  for  the  sake  of  contradiction  that  the  claim  is  not  true.  Then,  from  Proposition  28, 
gn  is  either  an  incomplete  ring  equilibrium  or  a  directed  line  equilibrium.  We  consider  the 
former  case  (the  latter  case  can  be  shown  with  similar  arguments).  There  exists  an  isolated 
social  clique  id",  such  that  |fd”|  <  k  and  all  cliques  with  index  less  than  £  are  not  isolated 
and  belong  to  the  incomplete  ring.  However,  from  the  definition  of  c  we  obtain  that  an 


agent  i  €  Hf  would  have  an  incentive  to  connect  to  the  incomplete  ring,  thus  we  reach 
a  contradiction.  In  particular,  consider  the  following  link  formation  strategy  for  agent  v. 


n, deviation 
Mim 


=  1  for  agent  m  €  and  g™’dematlon 


p”  for  j  7 -  m.  Then, 


m?(g 


n  /  n, deviations 


A 

A  +  r  J 


i /p2 + \H"\  +  ih?d 


>  max  <  7r  — 


1/p2  +  1/p2  ’  A  +  r  V  1/p2  +  l/p2\H\ 


—  c 

}  =  E[n"(jn)], 


where  the  strict  inequality  follows  from  the  definition  of  c  for  r  sufficiently  close  to  zero. 
Thus,  we  conclude  that  if  c  <  c.  gn  is  a  complete  ring  for  all  n  >  N,  where  A  is  a  large 
constant,  and  from  Part  1  asymptotic  learning  occurs  in  all  network  equilibria  (p,  a). 

On  the  contrary,  if  c  >  c,  then  there  exists  an  infinite  index  set  W,  such  that  for  all  n 
in  the  (infinite)  subsequence,  {nw}wew,  there  exists  a  k,  such  that 


i/p2  +  i/p2(Et,  \m\ + \m 


—  c  < 


i/p2  +  i/p2ii/;?+1r 


(F.15) 


Moreover,  \H£+1\  <  k  and  \Hj\  —  \^kA-  We  conclude  that  for  (F.15)  to  hold  it 

has  to  be  that  X^=i  l-^/l  <  R,  where  R  is  a  uniform  constant  for  all  n  in  the  subsequence. 
Consider  ( g such  that  for  every  n  in  the  subsequence,  gn  is  such  that  social  cliques 
with  index  greater  than  k  (as  described  above)  are  isolated  and  the  rest  form  an  incomplete 
ring  or  a  directed  line  and  an  —  INFO(Gn),  where  Gn  is  the  communication  network  in¬ 
duced  by  gn.  From  above,  we  obtain  that  for  c  >  c,  (pn,  an)  is  an  equilibrium  of  the  network 
learning  game  T(Cn).  Perfect  assymptotic  learning,  however,  fails  in  such  an  equilibrium, 
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since  for  every  i  £  Nn ,  \B™\  <  R,  where  recall  that  Bf  denotes  the  neighborhood  of  agent 
i.  m 

Proposition  29  concludes  the  proof  of  Theorem  3. 
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